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EXCERPTS FROM THE PREFACES 
TO THE FIRST AND SECOND EDITIONS 


THs book is devoted to the presentation of the theory of the electromagnetic and gravita- 
tional fields. i.e. electrodynamics and general relativity. A complete, logically connected 
theory of the electromagnetic field includes the special theory of relativity, so the latter 
has been taken as the basis of the presentation. As the starting point of the derivation of 
the fundamental relations we take the variational principles, which make possible the 
attainment of maximum generality, unity and simplicity of presentation. 

in accordance with the overall plan of our Course of Theoretical Physics (of which this 
book is a part). we have not considered questions concerning the electrodynamics of con- 
tinuous media, but restricted the discussion to "microscopic electrodynamics" —the 
electrodynamics of point charges in vacuo. 

The reader is assumed to be familiar with electromagnetic phenomena as discussed in 
general physics courses. A knowledge of vector analysis is also necessary. The reader is 
not assumed to have any previous knowledge of tensor analysis, which is presented in 
parallel with the development of ihe theory of gravitational fields. 


Moscow. December 1939 
Moscow, June 1947 L. LANDAU, E. LIFSHITZ 


PREFACE TO THE 
FOURTH ENGLISH EDITION 


THE first edition of this book appeared more than thirty years ago. In the course of reissues 
over these decades the book has been revised and expanded; its volume has almost doubled 
since the first edition. But at no time has there been any need to change the method proposed 
by Landau for developing the theory, or his style of presentation, whose main feature was 
a striving for clarity and simplicity. I have made every effort to preserve this style in the 
revisions that I have had to make on my own. 

As compared with the preceding edition, the first nine chapters, devoted to electrodyna- 
mics, have remained almost without changes. The chapters concerning the theory of the 
gravitational field have been revised and expanded. The material in these chapters has 
increased from edition to edition, and it was finally necessary to redistribute and rearrange 
It. 

I should like to express here my deep gratitude to all of my helpers in this work—too 
many to be enumerated-—who, by their comments and advice, helped me to eliminate 
errors and introduce improvements. Without their advice, without the willingness to help 
which has met all my requests, the work to continue the editions of this course would have 
been much more difficult. A special debt of gratitude is due to L. P. Pitaevskii, with whom 
I have constantly discussed all the vexing questions. 

The English translation of the book was done from the last Russian edition, which 
appeared in 1973. No further changes in the book have been made. The 1994 corrected 
reprint includes the changes made by E. M. Lifshitz in the Seventh Russian Edition 
published in 1987. 

I should also like to use this occasion to sincerely thank: Prof. Hamermesh, who has 
translated this book in all its editions, starting with the first English edition in 1951. The 
success of this book among English-speaking readers is to a large extent the result of his 
labor and careful attention. 

E. M. LiFSHITZ 


PUBLISHER'S NOTE 


As with the other volumes in the Course of Theoretical Physics, the authors do not, as a 
rule, give references to original papers, but simply name their authors (with dates). Full 
bibliographic references are only given to works which contain matters not fully expounded 
in the text. 


EDITOR’S PREFACE TO THE 
SEVENTH RUSSIAN EDITION 


E. M. Lifshitz began to prepare a new edition of Teoria Polia in 1985 and continued his 
work on it even in hospital during the period of his last illness. The changes that he 
proposed are made in the present edition. Of these we should mention some revision of 
the proof of the law of conservation of angular momentum in relativistic mechanics, and 
also a more detailed discussion of the question of symmetry of the Christoffel symbols in 
the theory of gravitation. The sign has been changed in the definition of the electro- 
magnetic field stress tensor. (In the present edition this tensor was defined differently than 
in the other volumes of the Course.) 


June 1987 L. P. PITAEVSKII 
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NOTATION 


Three-dimensional quantities 


Three-dimensional tensor indices are denoted by Greek letters 
Element of volume, area and length: dV, df, dl 

Momentum and energy of a particle: p and & 

Hamiltonian function: #7 

Scalar and vector potentials of the electromagnetic field: $ and A 
Electric and magnetic field intensities: E and H 

Charge and current density: p and j 

Electric dipole moment: d 

Magnetic dipole moment: m 


Four-dimensional quantities 


Four-dimensional tensor indices are denoted. by Latin letters i, k, /,... and take on the 
values 0, 1, 2, 3 
We use the metric with signature (+ — — —) 


Rule for raising and lowering indices—see p. 14 

Components of four-vectors are enumerated in the form A! = (A9, A) 

Antisymmetric unit tensor of rank four is e'*", where e?!?* = 1 (for the definition, see 
p. 17) 

Element of four-volume dQ = dx?dx!dx^dx? 

Element of hypersurface dS! (defined on pp. 19-20) 

Radius four-vector: x! = (ct, r) 

Velocity four-vector: wu! = dx'/ds 

Momentum four-vector: p = (&/c, p) 

Current four-vector: j' = (cp, pv) 

Four-potential of the electromagnetic field: 4! = ($, A) 

0A, 0A, 

0x Ox 
Fx to the components of E and H, see p. 61) 

Energy-momentum four-tensor 7* (for the definition of its components, see p. 78) 


Electromagnetic field four-tensor Fy, = (for the relation of the components of 
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CHAPTER 1 


THE PRINCIPLE OF RELATIVITY 


§ 1. Velocity of propagation of interaction 


For the description of processes taking place in nature, one must have a system of 
reference. By a system of reference we understand a system of coordinates serving to indicate 
the position of a particle in space, as well as clocks fixed in this system serving to indicate 
the time. 

There exist systems of reference in which a freely moving body, i.e. a moving body which 
is not acted upon by external forces, proceeds with constant velocity. Such reference systems 
are said to be inertial. 

If two reference systems move uniformly relative to each other, and if one of them is an 
inertial system, then clearly the other is also inertial (in this system too every free motion will 
be linear and uniform). In this way one can obtain arbitrarily many inertial systems of 
reference, moving uniformly relative to one another. 

Experiment shows that the so-called principle of relativity is valid. According to this 
principle all the laws of nature are identical in all inertial systems of reference. In other 
words, the equations expressing the laws of nature are invariant with respect to transforma- 
tions of coordinates and time from one inertial system to another. This means that the 
equation describing any law of nature, when written in terms of coordinates and time in 
different inertial reference systems, has one and the same form. 

The interaction of material particles is described in ordinary mechanics by means of a 
potential energy of interaction, which appears as a function of the coordinates of the inter- 
acting particles. It is easy to see that this manner of describing interactions contains the 
assumption of instantaneous propagation of interactions. For the forces exerted on each 
of the particles by the other particles at a particular instant of time depend, according to this 
description, only on the positions of the particles at this one instant. A change in the position 
of any of the interacting particles influences the other particles immediately. 

However, experiment shows that instantaneous interactions do not exist in nature. Thus a 
mechanics based on the assumption of instantaneous propagation of interactions contains 
within itself a certain inaccuracy. In actuality, if any change takes place in one of the inter- 
acting bodies, it will infiuence the other bodies only after the lapse of a certain interval of 
time. It is only after this time interval that processes caused by the initial change begin to 
take place in the second body. Dividing the distance between the two bodies by this time 
interval, we obtain the velocity of propagation of the interaction. 

We note that this velocity should, strictly speaking, be called the maximum velocity of 
propagation of interaction. It determines only that interval of time after which a change 
occurring in one body begins to manifest itself in another. It is clear that the existence of a 
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maximum velocity of propagation of interactions implies, at the same time, that motions of 
bodies with greater velocity than this arc in general impossible in nature. For if such a motion 
could occur, then by means of it one could realize an interaction with a velocity exceeding 
the maximum possible velocity of propagation of interactions. 

Interactions propagating from one particle to another are frequently called “signals”, 
sent out from the first particle and "informing" the second particle of changes which the 
first has experiencec. The velocity of propagation of interaction is then referred to as the 
signal velocity. 

From the principle of relativity it follows in particular that the velocity of propagation 
of interactions is the same in all inertial systems of reference. Thus the velocity of propaga- 
tion of interactions is a universal constant. This constant velocity (as we shall show later) is 
also the velocity of light in empty space. The velocity of light is usually designated by the 
letter c, and its numerical value is 


c = 2.998 x 10'° cm/sec. (1.1) 


The large value of this velocity explains the fact that in practice classical mechanics 
appears to be sufficiently accurate in most cases. The velocities with which we have occasion 
to deal are usually so small compared with the velocity of light that the assumption that the 
latter is infinite does not materially affect the accuracy of the results. 

The combination of the principle of relativity with the finiteness of the velocity of propaga- 
tion of interactions is called the principle of relativity of Einstein (it was formulated by 
Einstein in 1905) in contrast to the principle of relativity of Galileo, which was based on an 
infinite velocity of propagation of interactions. 

The-mechanics based on the Einsteinian principle of relativity (we shall usually refer to it 
simply as the principle of relativity) is called re/ativistic. In the limiting case when the 
velocities of the moving bodies are small compared with the velocity of light we can neglect 
the effect on the motion of the finiteness of the velocity of propagation. Then relativistic 
mechanics goes over into the usual mechanics, based on the assumption of instantaneous 
propagation of interactions; this mechanics is called Newtonian or classical. The limiting 
transition from relativistic to classical mechanics can be produced formally by the transition 
to the limit c > œ in the formulas of relativistic mechanics. 

In classical mechanics distance is already relative, i.e. the spatial relations between 
different events depend on the system of reference in which they are described. The state- 
ment that two nonsimultaneous events occur at one and the same point in space or, in 
general, at a definite distance from each other, acquires a meaning only when we indicate the 
system of reference which is used. 

On the other hand, time is absolute in classical mechanics; in other words, the properties 
of time are assumed to be independent of the system of reference; there is one time for all 
reference frames. This means that if any two phenomena occur simultaneously for any one 
observer, then they occur simultaneously also for all others. In general, the interval of time 
between two given events must be identical for all systems of reference. 

It is easy to show, however, that the idea of an absolute time is in complete contradiction 
to the Einstein principle of relativity. For this it is sufficient to recall that in classical 
mechanics, based on the concept of an absolute time, a general law of combination of 
velocities is valid, according to which the velocity of a composite motion is simply equal to 
the (vector) sum of the velocities which constitute this motion. This law, being universal, 
should also be applicable to the propagation of interactions. From this it would follow 


§ 2 VELOCITY OF PROPAGATION OF INTERACTION 3 


that the velocity of propagation must be different in different inertial systems of reference, 
in contradiction to the principle of relativity. In this matter experiment completely confirms 
the principle of relativity. Measurements first performed by Michelson (1881) showed 
complete lack of dependence of the velocity of light on its direction of propagation; whereas 
according to classical mechanics the velocity of light should be smaller in the direction of the 
earth’s motion than in the opposite direction. 

Thus the principle of relativity leads to the result that time is not absolute. Time elapses 
differently in different systems of reference. Consequcntly the statement that a definite time 
interval has elapsed between two given events acquires meaning only when the reference 
frame to which this statement applies is indicated. In particular, events which are simul- 
taneous in one reference frame will not be simultaneous in other frames. 

To clarify this, it is instructive to consider the following simple example. 

Let us look at two inertial reference systems K and K’ with coordinate axes X YZ and 
X' Y' Z' respectively, where the system K' moves relative to K along the X(X’) axis (Fig. 1). 


B-—A-C 


Fic. 1. 


Suppose signals start out from some point A on the X’ axis in two opposite directions. 
Since the velocity of propagation of a signal in the K' system, as in all inertial systems, is 
equal (for both directions) to c, the signals will reach points B and C, equidistant from A, 
at one and the same time (in the K' system) 

But it is easy to see that the same two events (arrival of the signal at B and C) can by 
no means be simultaneous for an observer in the K system. In fact, the velocity of a signal 
relative to the K system has, according to the principle of relativity, the same value c, 
and since the point B moves (relative to the K system) toward the source of its signal, 
while the point C moves in the direction away from the signal (sent from A to C), in the K 
system the signal will reach point B earlier than point C. 

Thus the principle of relativity of Einstein introduces very drastic and fundamental 
changes in basic physical concepts. The notions of space and time derived by us from our 
daily experiences are only approximations linked to the fact that in daily life we happen to 
deal only with velocities which are very small compared with the velocity of light. 


$ 2. Intervals 


In what follows we shall frequently use the concept of an event. An event is described by 
the place where it occurred and the time when it occurred. Thus an event occurring in a 
certain material particle is defined by the three coordinates of that particle and the time 
when the event occurs. 

It is frequently useful for reasons of presentation to use a fictitious four-dimensional 
space, on the axes of which are marked three space coordinates and the time. In this space 
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events are represented by points, called world points. In this fictitious four-dimensional space 
there corresponds to each particle a certain line, called a world line. The points of this line 
determine the coordinates of the particle at all moments of time. It is easy to show that to a 
particle in uniform rectilinear motion there corresponds a straight world line. 

We now express the principle of the invariance of the velocity of light in mathematical 
form. For this purpose we consider two reference systems K and K' moving relative to each 
other with constant velocity. We choose the coordinate axes so that the axes X and X" 
coincide, while the Y and Z axes are parallel to Y' and Z’; we designate the time in the 
systems K and K' by t and r’. 

Let the first event consist of sending out a signal, propagating with light velocity, from a 
point having coordinates x, y, z, in the K system, at time f, in this system. We observe the 
propagation of this signal in the K system. Let the second event consist of the arrival of the 
signal at point x; y2z; at the moment of time t}. The signal propagates with velocity c; 
the distance covered by it is therefore c(t, — t2). On the other hand, this same distance 
equals [(x;—xj)! - (y; —y,)?-(z2—z,)]*. Thus we can write the following relation 
between the coordinates of the two events in the K system: 

(x2= x) t (3—- y1) * (2; 7 2)? - e(( — 0) = 0. Q.1) 

The same two events, i.e. the propagation of the signal, can be observed from the K’ 
system: 

Let the coordinates of the first event in the K’ system be x, yz, 1j, and of the second: 


X5 Y2 2215. Since the velocity of light is the same in the K and K’ systems, we have, similarly 
to (2.1): 


Gi x)! *02- Y)! * 63-7 2)! - (5 1)? = 0. (2.2) 
If x, y, z, f; and x2 yzz;t; are the coordinates of any two events, then the quantity 
$12 — [c (t; 1? (65 7x) 703 7 3) G0 T (2.3) 


is called the interval between these two events. 

Thus it follows from the principle of invariance of the velocity of light that if the interval 
between two events is zero in one coordinate system, then it is equal to zero in all other 
systems. 

If two events are infinitely close to each other, then the interval ds between them is 

ds? = cdit? — dx? — dy? — dz*. (2.4) 

The form of expressions (2.3) and (2.4) permits us to regard the interval, from the formal 
point of view, as the distance between two points in a fictitious four-dimensional space 
(whose axes are labelled by x, y, z, and the product ct). But there is a basic difference 
between the rule for forming this quantity and the rule in ordinary geometry: in forming the 
square of the interval, the squares of the coordinate differences along the different axes are 
summed, not with the same sign, but rather with varying signs.f 

As already shown, if ds = 0 in one inertial system, then ds’ = 0 in any other system. On 
the other hand, ds and ds’ are infinitesimals of the same order. From these two conditions 
it follows that ds? and ds’? must be proportional to each other: 


ds? = ads"? 
where the coefficient a can depend only on the absolute value of the relative velocity of the 


t The four-dimensional geometry described by the quadratic form (2.4) was introduced by H. Minkowski, 
in connection with the theory of relativity. This geometry is called pseudo-euclidean, in contrast to ordinary 
euclidean geometry. 
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two inertial systems. It cannot depend on the coordinates or the time, since then different 
points in space and different moments in time would not be equivalent, which would be in 
contradiction to the homogeneity of space and time. Similarly, it cannot depend on the 
direction of the relative velocity, since that would contradict the isotropy of space. 

Let us consider three reference systems K, K,, K,, and let V, and V, be the velocities of 
systems K, and K, relative to K. We then have: 


ds? =a(V,)ds?, — ds? = a(Vy)ds3. 


Similarly we can write 
ds? = a(V,a)ds5, 


where V, ; is the absolute value of the velocity of K, relative to K,. Comparing these relations 
with one another, we find that we must have 
a(V,) 
av^ «h^» (2.5) 
But V, depends not only on the absolute values of the vectors V, and V2, but also on the 
angle between them. However. this angle does not appear on the left side of formula (2.5). 
It is therefore clear that this formula can be correct only if the function a(V) reduces to a 
constant, which is equal to unity according to this same formula. 
Thus, 


ds? = ds", (2.6) 


and from the equality of the infinitesimal intervals there follows the equality of finite 
intervals: s = s’. 

Thus we arrive at a very important result: the interval between two events is the same in all 
inertial systems of reference, ie. it is invariant under transformation from one inertial 
system to any other. This invariance is the mathematical expression of the constancy of the 
velocity of light. 

Again let x,y,2,7, and x,y22;1; be the coordinates of two events in a certain 
reference system K. Does there exist a coordinate system K', in which these two events 
occur at one and the same point in space? 

We introduce the notation 


I5 — [a = tha, (ax) Gor) Gi = Mp. 


Then the interval between events in the K system is: 


and in the K’ system 
sh scm ls 
whereupon, because of the invariance of intervals, 
ctia = yh, 
We want the two events to occur at the same point in the K’ system, that is, we require 
12 = 0. Then 
siccus. 

Consequently a system of reference with the required property exists if sł, > 0, that is, if 
the interval between the two events is a real number. Real intervals are said to be timelike. 

Thus, if the interval between two events is timelike, then there exists a system of reference 
in which the two events occur at one and the same place. The time which elapses between 
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the two events in this system is 


i ck fe SS 
fi, veta lis i (2.7) 

If two events occur in one and the same body, then the interval between them is always 
timelike, for the distance which the body moves between the two events cannot be greater 
than cf, ,, since the velocity of the body cannot exceed c. So we have always 


l2 «€ ety. 


Let us now ask whether or not we can find a system of reference in which the 
two events occur at one and the same time. As before, we have for the K and K' systems 
c2, P, = thla. We want to have fj; = 0, so that 

5$ = -n < 0. 

Consequently the required system can be found only for the case when the interval s;; 
between the two events is an imaginary number. Imaginary intervals are said to be spacelike. 

Thus if the interval between two events is spacelike, there exists a reference system in 


which the two events occur simultaneously. The distance between the points where the 
events occur in this system is 


w= VBen, = d5,5. (2.8) 

The division of intervals into space- and timelike intervals is, because of their invariance, 
an absolute concept. This means that the timelike or spacelike character of an interval is 
independent of the reference system. 

Let us take some event O as our origin of time and space coordinates. In other words, in 
the four-dimensional system of coordinates, the axes of which are marked x, y, z, t, the 
world point of the event O is the origin of coordinates. Let us now consider what relation 
other events bear to the given event O. For visualization, we shall consider only one space 
dimension and the time, marking them on two axes (Fig. 2). Uniform rectilinear motion of a 
particle, passing through x = 0 at t = 0, is represented by a straight line going through O 
and inclined to the ¢ axis at an angle whose tangent is the velocity of the particle. Since the 
maximum possible velocity is c, there is a maximum angle which this line can subtend with 
the ¢ axis. In Fig. 2 are shown the two lines representing the propagation of two signals 


a | C 
Absolute 
future 


Absolutely 
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Separated —— 


Abso:ute X 
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(with the velocity of light) in opposite directions passing through the event O (i.e. going 
through x = 0 at t= 0). All lines representing the motion of particles can lie only in the 
regions aOc and dOb. On the lines ab and cd, x = ct. First consider events whose world 
points lie within the region aOc. It is easy to show that for all the points of this region 
c^t? — x? > 0. In other words, the interval between any event in this region and the event O 
is timelike. In this region f > 0, i.e. all the events in this region occur "after" the event O. 
But two events which are separated by a timelike interval cannot occur simultaneously in 
any reference system. Consequently it is impossible to find a reference system in which any 
of the events in region aOc occurred "before" the event O, i.e. at time t <0. Thus all the 
events in region aOc are future events relative to O in all reference systems. Therefore this 
region can be called the absolute future relative to O. 

In exactly the same way, all events in the region bOd are in the absolute past relative to O; 
i.e. events in this region occur before the event O in all systems of reference. 

Next consider regions dOa and cOb. The interval between any event in this region and 
the event O is spacelike. These events occur at different points in space in every reference 
system. Therefore these regions can be said to be absolutely remote relative to O. However, 
the concepts "simultaneous", “earlier”, and “later” are relative for these regions. For any 
event in these regions there exist systems of reference in which it occurs after the event 
O, systems in which it occurs earlier than O, and finally one reference system in which it 
occurs simultaneously with O. 

Note that if we consider all three space coordinates instead of just one, then instead of 
the two intersecting lines of Fig. 2 we would have a “cone” x? y? -- z2 — c?t? = 0 in the 
four-dimensional coordinate system x, y, z, t, the axis of the cone coinciding with the / axis. 
(This cone is called the /ight cone.) The regions of absolute future and absolute past are then 
represented by the two interior portions of this cone. 

Two events can be related causally to each other only if the interval between them is 
timelike; this follows immediately from the fact that no interaction can propagate with a 
velocity greater than the velocity of light. As we have just seen, it is precisely for these events 
that the concepts "earlier" and "later" have an absolute significance, which is a necessary 
condition for the concepts of cause and effect to have meaning. 


§ 3. Proper time 


Suppose that in a certain inertial reference system we observe clocks which are moving 
relative to us in an arbitrary manner. At each different moment of time this motion can be 
considered as uniform. Thus at each moment of time we can introduce a coordinate system 
rigidly linked to the moving clocks, which with the clocks constitutes an inertial reference 
system. 

In the course of an infinitesimal time interval dt (as read by a clock in our rest frame) the 
moving clocks go a distance Vdx? 4-dy? +dz’. Let us ask what time interval dt’ is indicated 
for this period by the moving clocks. In a system of coordinates linked to the moving 
clocks, the latter are at rest, i.e., dx’ = dy’ = dz’ = 0. Because of the invariance of intervals 


ds? = c?di? — dx? - dy? - dz? = c?dt"^, 


; dx! e dy! e dz? 


from which 
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But 
dx*+dy?+dz° | 2 
di? Tu 


where v is the velocity of the moving clocks; therefore 


1 2 
petoa i, (3.1) 
C c 


Integrating this expression, we can obtain the time interval indicated by the moving clocks 
when the elapsed time according to a clock at rest is t; —1,: 


12 RM 
, + v? 
== | dt f1- (3.2) 


The time read by a clock moving with a given object is called the proper time for this object. 
Formulas (3.1) and (3.2) express the proper time in terms of the time for a system of reference 
from which the motion is observed. 

As we see from (3.1) or (3.2), the proper time of a moving object is always less than the 
corresponding interval in the rest system. In other words, moving clocks go more slowly 
than those at rest. 

Suppose some clocks are moving in uniform rectilinear motion relative to an inertial 
system K. A reference frame K’ linked to the latter is also inertial. Then from the point of 
view of an observer in the K system the clocks in the K' system fall behind. And con- 
versely, from the point of view of the K' system, the clocks in K lag. To convince ourselves 
that there is no contradiction, let us note the following. In order to establish that the clccks 
in the K' system lag behind those in the K system, we must proceed in the following fashion. 
Suppose that at a certain moment the clock in K' passes by the clock in K, and at that 
moment the readings of the two clocks coincide. To compare the rates of the two clocks in 
K and K' we must once more compare the readings of the same moving clock in K' with the 
clocks in K. But now we compare this clock with different clocks in K— with those past 
which the clock in K’ goes at this new time. Then we find that the clock in K’ lags behind the 
clocks in K with which it is being compared. We see that to compare the rates of clocks in 
two reference frames we require several clocks in one frame and one in the other, and that 
therefore this process is not symmetric with respect to the two systems. The clock that appears 
to lag is always the one which is being compared with different clocks in the other 
system. 

If we have two clocks, one of which describes a closed path returning to the starting point 
(the position of the clock which remained at rest), then clearly the moving clock appears to 
lag relative to the one at rest. The converse reasoning, in which the moving clock would be 
considered to be at rest (and vice versa) is now impossible, since the clock describing a 
closed trajectory does not carry out a uniform rectilinear motion, so that a coordinate 
system linked to it will not be inertial. 

Since the laws of nature are the same only for inertial reference frames, the frames linked 
to the clock at rest (inertial frame) and to the moving clock (non-inertial) have different 
properties, and the argument which leads to the result that the clock at rest must lag is not 
valid. 
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The time interval read by a clock is equal to the integral 
b 
l [as 
[4 
a 
taken along the world line of the clock. 1f the clock is at rest then its world line is clearly a 
line parallel to the ¢ axis; if the clock carries out a nonuniform motion in a closed path and 
returns to its starting point, then its world line will be a curve passing through the two points, 
on the straight world line of a clock at rest, corresponding to the beginning and end of the 
motion. On the other hand, we saw that the clock at rest always indicates a greater time 
interval than the moving one. Thus we arrive at the result that the integral 
b 
| ds, 
a 
taken between a given pair of world points, has its maximum value if it is taken along the 
straight world line joining these two points.t 


$ 4. The Lorentz transformation 


Our purpose is now to obtain the formula of transformation from one inertial reference 
system to another, that is. a formula by means of which, knowing the coordinates x, y, Z, t, 
of a certain event in the K system, we can find the coordinates x’, y’, z', t' of the same event 
in another inertial system K’. 

In classica! mechanics this question is resoived very simply. Because of the absolute 
nature of time we there have t = t’; if, furthermore, the coordinate axes are chosen as usual 
(axes X, X' coincident, Y, Z axes parallel to Y', Z', motion along X, X^) then the co- 
ordinates y, z clearly are equal to 1”, z’, while the coordinates x and x’ differ by the distance 
traversed by one system relative to the other. If the time origin is chosen as the moment when 
the two coordinate systems coincide, and if the velocity of the K' system relative to K is V, 
then this distance is Vt. Thus 


L 


x=x'+Vt, yey, z=7', =f. (4.1) 


This formula is called the Galileo transformation. It is easy to verify that this transformation, 
as was to be expected, does not satisfy the requirements of the theory of relativity; it does 
not leave the interval between events invariant. 

We shall obtain the relativistic transformation precisely as a consequence of the require- 
ment that it leave the interval between events invariant. 

As we saw in § 2, the interval between events can be looked on as the distance between the 
corresponding pair of world points in a four-dimensional system of coordinates. Conse- 
quently we may say that the required transformation must leave unchanged all distances in 
the four-dimensional x, y, z, cf, space. But such transformations consist only of parallel 
displacements, and rotations of the coordinate system. Of these the displacement of the co- 
ordinate system parallel to itself is of no interest, since it leads only to a shift in the origin 
of the space coordinates and a change in the time reference point. Thus the required trans- 


+ It is assumed, of course, that the points a and b and the curves joining them are such that all elements ds 
along the curves are timelike. 

This property of the integral is connected with the pseudo-euclidean character of the four-dimensional 
geometry. In euclidean space the integral would, of course, be a minimum along the straight line. 
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formation must be expressible mathematically as a rotation of the four-dimensional 
X, ¥, Z, ct, coordinate system. 

Every rotation in the four-dimensional space can be resolved into six rotations, in the 
planes xy, zy, xz, tx, fy, tz (just as every rotation in ordinary space can be resolved into three 
rotations in the planes xy, zy, and xz). The first three of these rotations transform only the 
space coordinates; they correspond to the usual space rotations. 

Let us consider a rotation in the tx plane; under this, the y and z coordinates do not 
change. In particular, this transformation must leave unchanged the difference (ct)? — x?, 
the square of the “‘distance” of the point (ct, x) from the origin. The relation between the 
old and the new coordinates is given in most general form by the formulas: 

x = x' cosh y 4 ct' sinh y, ct = x' sinh y 4 ct' cosh y, (4.2) 
where y is the “angle of rotation"; a simple check shows that in fact c?t? — x? = c?t? — x"?, 
Formula (4.2) differs from the usual formulas for transformation under rotation of the co- 
ordinate axes in having hyperbolic functions in place of trigonometric functions. This is the 
difference between pseudo-euclidean and euclidean geometry. 

We try to find the formula of transformation from an inertial reference frame K to a 
system K' moving relative to K with velocity V along the x axis. In this case clearly only the 
coordinate x and the time ! are subject to change. Therefore this transformation must have 
the form (4.2). Now it remains only to determine the angle y, which can depend only on the 
relative velocity V.t 

Let us consider the motion, in the K system, of the origin of the K' system. Then x’ = 0 
and formulas (4.2) take the form: 


x — ct' sinh y, ct — ct' cosh y, 
or dividing one by the other, 


x 
— = tanh y. 
ct 


But x/t is clearly the velocity V of the K' system relative to K. So 


tanh y = — 
From this 
V 
c l 
sinh y = -, cosh y = 5: 
V 
1- — 1- — 
J c? J c? 
Substituting in (4.2), we find: 
x Vt ; itor 
x=," yy, z=7', (= o—————— (4.3) 


y? M V? 
Ji-5 Js 
c c 


This is the required transformation formula. It is called the Lorentz transformation, and is of 
fundamental importance for what follows. 


t Note that to avoid confusion we shall always use V to signify the constant relative velocity of two 
inertial systems, and v for the velocity of a moving particle, not necessarily constant. 
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The inverse formulas, expressing x’, y’, 2’, t' in terms of x, y, Z, t, are most easily obtained 
by changing V to — V (since the K system moves with velocity — V relative to the K' 
system). The same formulas can be obtained directly by solving equations (4.3) for x’, y’, z', t'. 

It is easy to see from (4.3) that on making the transition to the limit c — oo and classical 
mechanics, the formula for the Lorentz transformation actually goes over into the Galileo 
transformation. 

For V > cin formula (4.3) the coordinates x, ¢ are imaginary; this corresponds to the fact 
that motion with a velocity greater than the velocity of light is impossible. Moreover, one 
cannot use a reference system moving with the velocity of light—in that case the 
denominators in (4.3) would go to zero. 

For velocities V small compared with the velocity of light, we can use in place of (4.3) 
the approximate formulas: 


V 
x=x'+Vr, yey, z=2', t=+5x'. (4.4) 
c 


Suppose there is a rod at rest in the K system, parallel to the X axis. Let its length, 
measured in this system, be Ax = x; — x, (x, and x, are the coordinates of the two ends of 
the rod in the K system). We now determine the length of this rod as measured in the K’ 
system. To do this we must find the coordinates of the two ends of the rod (x; and x) in 
this system at one and the same time ¢’. From (4.3) we find: 


xj Vt xt Vr 


TP "a 
— — 
[4 c 


The length of the rod in the K’ system is Ax’ = x;— x; subtracting x, from x;, we find 
Ax' 


aa 
{—-— 
c 
The proper length of a rod is its length in a reference system in which it is at rest. Let us 
denote it by /j = Ax, and the length of the rod in any other reference frame K’ by /. Then 


V? 
id 1-5. (4.5) 


Thus a rod has its greatest length in the reference system in which it is at rest. Its length 


in a system in which it moves with velocity V is decreased by the factor V1—V2/c?. This 
result of the theory of relativity is called the Lorentz contraction. 

Since the transverse dimensions do not change because of its motion, the volume Y^ of a 
body decreases according to the similar formula 


y? 
Y - Y^. Ji-% (4.6) 


where %, is the proper volume of the body. 

From the Lorentz transformation we can obtain anew the results already known to us 
concerning the proper time (§ 3). Suppose a clock to be at rest in the K’ system. We take 
two events occurring at one and the same point x’, y’, z' in space in the K' system. The time 
between these events in the K’ system is At’ = t5—1,. Now we find the time At which 


Ax = 
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elapses between these two events in the K system. From (4.3), we have 


t^ + V + t + V + 
—X zx 
1 c 2 c? 


or, subtracting one from the other, 


Ar 
t;—t, =At= ny 


V? 
J l= 
in complete agreement with (3.1). 

Finally we mention another general property of Lorentz transformations which distin- 
guishes them from Galilean transformations. The latter have the general property of com- 
mutativity, i.e. the combined result of two successive Galilean transformations (with 
different velocities V, and V,) does not depend on the order in which the transformations 
are performed. On the other hand, the result of two successive Lorentz transformations does 
depend, in general, on their order. This is already apparent purely mathematically from our 
formal description of these transformations as rotations of the four-dimensional coordinate 
system: we know that the result of two rotations (about different axes) depends on the order 
in which they are carried out. The sole exception is the case of transformations with parallel 
vectors V, and V, (which are equivalent to two rotations of the four-dimensional coordinate 
system about the same axis). 


$ 5. Transformation of velocities 


In the preceding section we obtained formulas which enable us to find from the coordinates 
of an event in one reference frame, the coordinates of the same event in a second reference 
frame. Now we find formulas relating the velocity of a material particle in one reference 
system to its velocity in a second reference system. 

Let us suppose once again that the K' system moves relative to the K system with velocity 
V along the x axis. Let v, = dx/dt be the component of the particle velocity in the K system 
and v; = dx’/dt’ the velocity component of the same particle in the K’ system. From (4.3), 
we have 


de, dr 


we find 


(5.1) 
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These formulas determine the transformation of velocities. They describe the law of com- 
position of velocities in the theory of relativity. In the limiting case of c > oo, they go over 
into the formulas v, = v, + V, v, = tp v, = v; of classical mechanics. 

In the special case of motion of a particle parallel to the X axis, v, = v, v, =v, = 0. 
Then v, = v; = 0, v, = v', so that 


v= c + V i (5.2) 


1+ RA 
p x 
c? 


It is easy to convince oneself that the sum of two velocities each smaller than the velocity 
of light is again not greater than the light velocity. 

For a velocity V significantly smaller than the velocity of light (the velocity v can be 
arbitrary). we have approximately, to terms of order V/c: 


; vy jets pon ce A 
ve =v,+Vt1- =], D, = UQ— UU, 5, U: = UQ— 0,0; 5. 
c c 
These three formulas can be written as a single vector formula 
t I , , 
v=w+V—-3(V-v')v’. (5.3) 
c 


We may point out that in the relativisticelaw of addition of velocities (5.1) the two 
velocities v' and V which are combined enter unsymmetrically (provided they are not both 
directed along the x axis). This fact is related to the noncommutativity of Lorentz trans- 
formations which we mentioned in the preceding Section. 

Let us choose our coordinate axes so that the velocity of the particle at the given moment 
lies in the XY plane. Then the velocity of the particle in the K system has components 
v, = v cos 0, c, = v sin 0, and in the K’ system v, = v' cos 0’, v, = v' sin 0’ (t, v', 0, 0' are 
the absolute values and the angles subtended with the X, X' axes respectively in the K, K’ 
systems). With the help of formula (5.1), we then find 


V? 
v NMEDET. 
c 


= ccs 5.4 
ay v cos 0’+V on 


This formula describes the change in the direction of the velocity on transforming from 
one reference system to another. 

Let us consider a very important special case of this formula, namely, the deviation of 
light in transforming to a new reference system—a phenomenon known as the aberration 
of light. In this case t = v' = c, so that the preceding formula goes over into 


ian 0 = —— sin b. (5.5) 
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From the same transformation formulas (5.1) it is easy to obtain for sin 0 and cos 0: 


y Nu 
1—-—-; cos 0' + — 
c c 


sin 9 = — —————— sin l’, cos 0 = —————. (5.6) 
V V 
1 4- — cos 0' 1+ — cos 0' 
C C 


In case V < c, we find from this formula, correct to terms of order V/c: 
x ` " V : 1 D 
sin @—sin 0’ = — — sin 0' cos @’. 
c 


Introducing the angle A0 = 0' —0 (the aberration angle), we find to the same order of 
accuracy 


V 
A0 = — sin 6’, (5.7) 
c 


which is the well-known elementary formula for the aberration of light. 


§ 6. Four-vectors 


The coordinates of an event (ct, x, y, z) can be considered as the components of a four- 
dimensional radius vector (or, for short, a four-radius vector) in a four-dimensional space. 
We shall denote its components by x', where the index i takes on the values 0, 1, 2, 3, and 


x°=ct, xi=x, x=y, x? =z. 


The square of the "length" of the radius four-vector is given by 
po eoo 
It does not change under any rotations of the four-dimensional coordinate system, in 
particular under Lorentz transformations. 
In general a set of four quantities A°, 4!, 4?, A? which transform like the components 
of the radius four-vector x' under transformations of the four-dimensional coordinate 
system is called a four-dimensional vector (four-vector) A'. Under Lorentz transformations, 


V 
A'94 — A! Als — A'9 

c 
200452 ——, AA? AHA (61) 


J4-5 Ji-4 
c? PI 


The square magnitude of any four-vector is defined analogously to the square of the radius 
four-vector: 


(45 — (4! - (?y - (y. 
For convenience of notation, we introduce two *'types" of components of four-vectors, 
denoting them by the symbols A‘ and Aj, with superscripts and subscripts. These are related 
by 
Ag =A, A, — —A!, A,= — A’, 4, = — A3. (6.2) 

The quantities A‘ are called the contravariant, and the A; the covariant components of the 
four-vector. The square of the four-vector then appears in the form 

3 

Y. AA; = AA, +AA, + A2 A54 + AA. 


i-o 
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Such sums are customarily written simply as 4'4;, omitting the summation sign. One 
agrees that one sums over any repeated index, and omits the summation sign. Of the pair 
of indices, one must be a superscript and the other a subscript. This convention for sum- 
mation over “dummy” indices is very convenient and considerably simplifies the writing of 
formulas. 

We shall use Latin letters i, k, /,..., for four-dimensional indices, taking on the values 
0, 1, 2; 3. 

In analogy to the square of a four-vector, one forms the scalar product of two different 
four-vectors: 

AB, = A°B,+A'B, + A? B, AB,. 


It is clear that this can be written either as A'B, or 4,;B'—the result is the same. In general 
one can switch upper and lower indices in any pair of dummy indices. 

The product A'B; is a four-scalar—it is invariant under rotations of the four-dimensional 
coordinate system. This is easily verified directly,i but it is also apparent beforehand (from 
the analogy with the square 4/4;) from the fact that all four-vectors transform according to 
the same rule. 

The component A? is called the time component, and A! , A?, A? the space components of 
the four-vector (in analogy to the radius four-vector). The square of a four-vector can be 
positive, negative, or zero; such vectors are called, timelike, spacelike, and null-rectors, 
respectively (again in analogy to the terminology for intervals).§ 

Under purely spatial rotations (i.e. transformations not affecting the time axis) the three 
space components of the four-vector A‘ form a three-dimensional vector A. The time 
component of the four-vector is a three-dimensional scalar (with respect to these trans- 
formations). In enumerating the components of a four-vector, we shall often write them as 


A! = (49, A). 


The covariant components of the same four-vector are A; = (49, — A), and the square of 
the four-vector is 4/4; — (49)? — A?. Thus, for the radius four-vector: 


x! = (ct, r), x; — (ct, —r), xix, = c? - r2, 


For three-dimensional vectors (with coordinates x, y, z) there is no need to distinguish 
between contra- and covariant components. Whenever this can be done without causing 
confusion, we shall write their components as A, (x = x, y, z) using Greek letters for sub- 
scripts. In particular we shall assume a summation over x, y, z for any repeated index (for 
example, A: B = A, B,). i 

A four-dimensional tensor (four-tensor) of the second rank is a set of sixteen quantities 
A'*, which under coordinate transformations transform like the products of components of 
two four-vectors. We similarly define four-tensors of higher rank. 


+ In the literature the indices are often omitted on four-vectors, and their squares and scalar products are 
written as 42, AB. We shall not use this notation in the present text. 

1 Oneshould remember that the law for transformation of a four-vector expressed in covariant components 
differs (in signs) from the same law expressed for contravariant components. Thus, instead of (6.1), one will 
have: 


$ Null vectors are also said to be isotropic. 
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The components of a second-rank tensor can be written in three forms: covariant, Aix 
contravariant, A“, and mixed, Aj, (where, in the last case, one should distinguish between 
A', and Aj", i.e. one should be careful about which of the two is superscript and which a 
subscript). The connection between the different types of components is determined from 
the general rule: raising or lowering a space index (1, 2, 3) changes the sign of the com- 
ponent, while raising or lowering the time index (0) does not. Thus: 


00 01 11 
Agog =A, Áo = 24", An ZA ,.., 
0 00 vi 01 0 Ot 1 11 
Ay =A”, Ag =A”, AU. = A", A =-A",.... 
Under purely spatial transformations, the nine quantities 4!!, A'*,... form a three- 


tensor. The three components 4°!', 4??, 4?? and the threc components 4'°, A?°, A%° 


constitute three-dimensional vectors. while the component A9? is a three-dimensional 
scalar. 


A tensor A" is said to be symmetric if A'* = A“, and antisymmetric if A* = ~A™. In an: 
antisymmetric tensor, all the diagonal components (i.e. the components 499, A!!,...) 
are zero, since, for example, we must have 499 = — 499. For a symmetric tensor A^, the 


mixed components A’, and A, obviously coincide; in such cases we shall simply write Ai, 
putting the indices one above the other. 

In every tensor equation, the two sides must contain identical and identically placed 
(i.e. above or below) free indices (as distinguished from dummy indices). The free indices in 
tensor equations can be shifted up or down, but this must be done simultaneously in all 
terms in the equation. Equating covariant and contravariant components of different 
tensors is "illegal" ; such an equation, even if it happened by chance to be valid in a particular 
reference system, would be violated on going to another frame. 

From the tensor components A* one can form a scalar by taking the sum 

Ai, = A99 A! E AA, 
(where, of course, Aj! = A'j). This sum is called the trace of the tensor. and the operation for 
obtaining it ts called contraction. 

The formation of the scalar product of two vectors, considered earlier, is a contraction 
operation: it is the formation of the scalar .4‘B; from the tensor .4‘B,. In general, contracting 
‘on any pair of indices reduces the rank of the tensor by 2. For example, 4; is a tensor of 
second rank A’, BY is a four-vector, A'*;, is a scalar, etc. 

The unit four-tensor ô$ satisfies the condition that for any four-vector A’, 


It is clear that the components of this tensor are 
l, if i=k 
ój-4 . 6.4 
t o if ixk (6.4) 


Its trace is ôi = 4. 

By raising the one index or lowering the other in ô}, we can obtain the contra- or covariant 
tensor g or gy, which is called the metric tensor. The tensors e^ and gą have identical 
components, which can be written as a matrix: 


1 0 0 0 

iky [9 -! 0 0 
(g^) = (gi) = 0 0 =i 0 (6.5) 

0 0 0 -1 
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(the index i labels the rows, and k the columns, in the order 0, 1, 2, 3). It is clear that 


9A" = Aj, g" A, = Al (6.6) 
The scalar product of two four-vectors can therefore be written in the form: 
A! A; = gu AiA* = 8" A,A,. (6.7) 


The tensors di, g g are special in the sense that their components are the same in all 
coordinate systems. The completely antisymmetric unit tensor of fourth rank, e"", has the 
same property. This is the tensor whose components change sign under interchange of any 
pair of indices, and whose nonzero components are +1. From the antisymmetry it follows 
that all components in which two indices are the same are zero, so that the only non- 
vanishing components are those for which all four indices are different. We set 


g23 = 41 (6.8) 


(hence @9123 = — 1). Then all the other nonvanishing components e" are equal to +1 or 


— 1, according as the numbers i, k, /, m can be brought to the arrangement 0, 1, 2, 3 by an 
even or an odd number of transpositions. The number of such components is 4! = 24. Thus, 


eme dm = — 24. (6.9) 


With respect to rotations of the coordinate system, the quantities e^" behave like the 
components of a tensor; but if we change the sign of one or three of the coordinates the 
components e", being defined as the same in all coordinate systems, do not change, whereas 
some of the components of a tensor should change sign. Thus e* is, strictly speaking, 
not a tensor, but rather a pseudotensor. Pseudotensors of any rank, in particular 
pseudoscalars, behave like tensors under all coordinate transformations except those that 
cannot be reduced to rotations, i.e. reflections, which are changes in sign of the coordinates 
that are not reducible to a rotation. 

The products e'"e?"* form a four-tensor of rank 8, which is a true tensor; by contracting 
on one or more pairs of indices, one obtains tensors of rank 6, 4, and 2. All these tensors 
have the same form in all coordinate systems. Thus their components must be expressed as 
combinations of products of components of the unit tensor ói—the only true tensor whose 
components are the same in all coordinate systems. These combinations can easily be found 
by starting from the symmetries that they must possess under permutation of indices. 

If A* is an antisymmetric tensor, the tensor A“ and the pseudotensor A** = Je" 4,,, 
are said to be dual to one another. Similarly, e*"4,, is an antisymmetric pseudotensor of 
rank 3, dual to the vector A’. The product A 4f of dual tensors is obviously a pseudoscalar. 


T For reference we give the following formulas: 
ój ót ói Ot 


, { t ost 
ó* 5" ó* OOK lo; 6, o: i. 
etme = — |S D | Orem s OS 0; OF 
o ð, ô; ‘ | PX Ò c 
m m m m r T S, 
op àv OF Òr, 
eme im = —2(6; ór — ðið; } Oeps - 65). 


The overall coefficient in these formulas can be checked using the resuh of a complete contraction, which 
should give (6.9). 
As a consequence of these formulas we have: 


e Ain Arr Ais Ami © AC pn 
eme Ai Air Ais Am = 4A, 


where 4 is the determinant formed from the quantities Ax. 
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In this connection we note some analogous properties of three-dimensional vectors and 
tensors. The completely antisymmetric unit pseudotensor of rank 3 is the set of quantities 
€,5, Which change sign under any transposition of a pair of indices. The only nonvanishing 
components of e,,, are those with three different indices. We set e,,, = 1; the others are 1 
or — 1, depending on whether the sequence a, f, y can be brought to the order x, y, z by an 
even or an odd number of transpositions.t 

The products e,5,e;,, form a true three-dimensional tensor of rank 6, and are therefore 
expressible as combinations of products of components of the unit three-tensor ó,,.1 

Under a reflection of the coordinate system, i.e. under a change in sign of all the co- 
ordinates, the components of an ordinary vector also change sign. Such vectors are said to 
be polar. The components of a vector that can be written as the cross product of two polar 
vectors do not change sign under inversion. Such vectors are said to be axial. The scalar 
product of a polar and an axial vector is not a true scalar, but rather a pseudoscalar: it 
changes sign under a coordinate inversion. An axial vector is a pseudovector, dual to some 
antisymmetric tensor. Thus, if C = A x B, then 


Ca = lej,Cs, where Cp, = A&B,— A,B,. 

Now consider four-tensors. The space components (i, k, — 1, 2, 3) of the antisymmetric 
tensor A form a three-dimensional antisymmetric tensor with respect to purely spatial 
transformations; according to our statement its components can be expressed in terms of 
the components of a three-dimensional axial vector. With respect to these same trans- 
formations the components A°!, 49?, 49? form a three-dimensional polar vector. Thus the 
components of an antisymmetric four-tensor can be written as a matrix: 


0 p P Ek 


— Px 0 -a, a 

a= a 0 d (6.10) 
y z x 
=P: —ay, ay 0, 


where, with respect to spatial transformations, p and a are polar and axial vectors, re- 
spectively. In enumerating the components of an antisymmetric four-tensor, we shall write 
them in the form 


A* = (p,a); 
then the covariant components of the same tensor are 


Ax = (—p, a). 
Finally we consider certain differential and integral operations of four-dimensional tensor 
analysis. 


+ The fact that the components of the four-tensor e™'™ are unchanged under rotations of the four- 
dimensional coordinate system, and that the components of the three-tensor €«sy are unchanged bv rotations 
of the space axes are special cases of a general rule: any completely antisymmetric tensor of rank equal to 
the number of dimensions of the space in which it is defined is invariant under rotations of the coordinate 
system in the space. 

1 For reference, we give the appropriate formulas: 

DL Oau bay 

Ona Seu Ós, . 
LAM Oyu Oy 

Contracting this tensor on one, two and three pairs of indices, we get: 


€apy € any = 


CapyCiny = Oak Ós, — ax Opa, 
€s5y Cty = 2a, 
Easy Cay = 6. 
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The four-gradient of a scalar $ is the four-vector 

od C 0ó v 

ax! Nc at’ $): 
We must remember that these derivatives are to be regarded as the covariant components 
of the four-vector. In fact, the differential of the scalar 

[7 
d$ = as dx 

is also a scalar; from its form (scalar product of two four-vectors) our assertion is obvious. 

In general, the operators of differentiation with respect to the coordinates x', 2/éx', 
should be regarded as the covariant components of the operator four-vector. Thus, for 
example, the divergence of a four-vector, the expression 0A'/0x', in which we differentiate 
the contravariant components Z', is a scalar.t 

In three-dimensional space one can extend integrals over a volume, a surface or a curve. 
In four-dimensional space there are four types of integrations: 

(1) Integral over a curve in four-space. The element of integration is the iine element, i.e. 
the four-vector dx’. 

(2) Integral over a (two-dimensional) surface in four-space. As we know, in three- 
space the projections of the area of the parallelogram formed from the vectors dr 
and dr on the coordinate planes x,x, are dx,dx,—dx,dx,. Analogously, in four- 
space the infinitesimal element of surface is given by the antisymmetric tensor of 
second rank df* = dx'dx'* — dx*dx'; its components are the projections of the element of 
area on the coordinate planes. In three-dimensional space, as we know, one uses as surface 
element in place of the tensor df,, the vector df, dual to the tensor df,,: df, = 1e,5, df;,. 
Geometrically this is a vector normal to the surface element and equal in absolute mag- 
nitude to the area of the element. In four-space we cannot construct such a vector, but we 
can construct the tensor d/ *'* dual to the tensor df“, 


df ** = Leds... (6.11) 


Geometrically it describes an element of surface equal to and "normal" to the element of 
surface df; all segments lying in it are orthogonal to all segments in the element df". 
- It is obvious that df *df& = 0. 
(3) Integral over a hypersurface, i.e. over a three-dimensional manifold. In three- 
dimensional space the volume of the parallelepiped spanned by three vectors is equal to the 


+ If we differentiate with respect to the “covariant coordinates" x,, then the derivatives 


d$ 1d¢ 
-= C Es -vé) 
2x, cot 
form the contravariant components of a four-vector. We shall use this form only in exceptional cases [for 
example, for writing the square of the four-gradient (24/23*)/(29/ ^x]. 
We note that in the literature partial derivatives with respect to the coordinates are often abbreviated 
using the symbols 
d 2 
aee p ES 
2X, ax! 
In this form of writing of the differentiation operators, the co- or contravariant character of quantities formed 
with them is explicit. This same advantage exists for another abbreviated form for writing derivatives, 
using the index preceded by a comma: 


20 THE PRINCIPLE OF RELATIVITY § 6 


determinant of the third rank formed from the components of the vectors. One obtains 
analogously the projections of the volume of the parallelepiped (i.e. the “areas” of the 
hypersurface) spanned by three four-vectors dx', dx", dx"; they are given by the deter- 
minants 


idx! dx" dx" 
dS —dx* dx'" dx'*, 
idx! dx" dx" 


which form a tensor of rank 3, antisymmetric in all three indices. As element of integration 
over the hypersurface, it is more convenient to use the four-vector dS', dual to the tensor 
as": 
dS' = — fe" dS ims Skim = Énkim dS". (6.12) 
Here 
dS! eds. dstesds ues 


Geometrically dS‘ is a four-vector equal in magnitude to the "areas" of the hypersurface 
element, and normal to this element (i.e. perpendicular to all lines drawn in the hyper- 
surface element). In particular, d$? = dx dy dz, i.e. it is the element of three-dimensional 
volume dV, the projection of the hypersurface element on the hyperplane x? = const. 

(4) Integral over a four-dimensional volume; the element of integration is the scalar 


dQ = dx"dx!dx? dx? = cdtdV. (6.13) 
The element is a scalar: it is obvious that the volume of a portion of four-space is un- 
changed by a rotation of the coordinate system.t 
Analogous to the theorems of Gauss and Stokes in three-dimensional vector analysis, 
there are theorems that enable us to transform four-dimensional integrals. 
The integral over a closed hypersurface can be transformed into an integral over the four- 
volume contained within it by replacing the element of integration dS; by the operator 


dS; ^ dQ =... (6.14) 
ôx 
For example, for the integral of a vector A‘ we have: 
; ðA' 
$ A'dS; = [ dO. (6.15) 
J Ox 


This formula is the generalization of Gauss' theorem. 
An integral over a two-dimensional surface is transformed into an integral over the hyper- 
surface "spanning" it by replacing the element of integration df/ž by the operator 


à ð 
dfă dS; =, — dS. 5 (6.16) 


+ Under a transformation from the integration variables x°, x', x?, x? to new variables x°, x'!, x'?, x, 


the element of integration changes to J d£, where dY — dx’? dx'! dx’? dx? 


0€(x'8, x, x'?, x3) 


€(x?, x! , x?, x?) 
is the Jacobian of the transformation. For a linear transformation of the form x“ == ajx*, the Jacobian J 


coincides with the determinant ja/| and is equal to unity for rotations of the coordinate systern; this shows 
the invariance of dQ. 
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For example, for the integral of an antisymmetric tensor A* we have: 


1 1 0A* ðA” QA* 
: [ 2 - 5 [ (as, A IS i EE fas% - (6.17) 


The integral over a four-dimensional closed curve is transformed into an integral over the 
surface spanning it by the substitution: 


; . 0 
dx! 2 df" EN (6.18) 
Thus for the integral of a vector, we have: 


d xi Ci = ik 8A, _ 04i 
$us = Í d Ax >| df e 34) (6.19) 


which is the generalization of Stokes' theorem. 


PROBLEMS 


1, Find the law of transformation of thc components of a symmetric four-tensor A under 
Lorentz transformations (6.1). 


Solution: Considering the components of the tensor as products of components of two four- 
vectors, we get: 


Ave = ae (4°. 2 K go E a), AM = d (as 25] V eiu p a), 
ve c € [4 [^ 


and analogous formulas for 4??, 4!? and 4°”. 
2. The same for the antisymmetric tensor 4*, 


Solution: Since the coordinates x? and x? do not change, the tensor component 4?? does not 
change, while the components A?*, A‘? and A??, 49? transform like x! and x°: 


A7? i- K go A? Vana 
A = A73. AY LI ee , A9? = c 


EZ FF yi 
pns Jie 
and similarly for 4!?, 49?, 


With respect to rotations of the two-dimensional coordinate system in the plane x?x! (which are 
the transformations we are considering) the components 49?! = — 4!?, 499 = 41! — 0, form an 
antisymmetric of tensor of rank two, equal to the number of dimensions of the space. Thus (see the 
remark on p. 18) these componenis are not changed by the transformations: 


A9! = A}, 
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§ 7. Four-dimensional velocity 


From the ordinary three-dimensional velocity vector one can form a four-vector. This 
four-dimensional velocity ( four-velocity) of a particle is the vector 
(Ld (7.1) 
uz. . 
ds 


To find its components, we note that according to (3.1), 


—5 
as= ett i Ss 
c 


where v is the ordinary three-dimensional velocity of the particle. Thus 


etc. Thus 


i-[—l X (7.2) 


v v? 
J-i eis 


Note that tbe four-velocity is a dimensionless quantity. 
The components of the four-velocity are not independent. Noting that dx; dx! = ds?, we 
have 


ulu, — 1. (7.3) 


Geometrically, u' is a unit four-vector tangent to the world line of the particle. 
Similarly to the definition of the four-velocity, the second derivative 


dx! dy! 
T d? ds 
may be called the four-acceleration. Differentiating formula (7.3), we find: 
uw! = 0, (7.4) 


i.e. the four-vectors of velocity and acceleration are "mutually perpendicular". 


PROBLEM 


Determine the relativistic uniformly accelerated motion, i.e. the rectilinear motion for which the 
acceleration w in the proper reference frame (at each instant of time) remains constant. 

Solution: In the reference frame in which the particle velocity is v — 0, the components of the 
four-acceleration w! = (0, w/c?, 0, 0) (where w is the ordinary three-dimensional acceleration, 
which is directed along the x axis). The relativistically invariant condition for uniform acceleration 
must be expressed by the constancy of the four-scalar which coincides with w? in the proper reference 
frame: 

w2 


w'w, = const = — —. 
c 
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In the “fixed” frame, with respect to which the motion is observed, writing out the expression for 
w'w, gives the equation 


d v v 
w Or — Á = wt+const. 


d P3. 2 
h-5 1-5 
c c 
Setting v = 0 for t = 0, we find that const = 0, so that 
wt 
m 
Jes 


Integrating once more and setting x = 0 for r =0, we find: 


ce? vip 
:-z( l| +— -1), 
w c 


For wt «£c, these formulas go over the classical expressions v = wt, x = wt?/2. For wt — co, the 
velocity tends toward the constant value c. 
The proper time of a uniformly accelerated particle is given by the integral 


v= 


As t— co, it increases much more slowly than ¢, according to the law c/w In (2w?/c). 


CHAPTER 2 


RELATIVISTIC MECHANICS 


§ 8. The principle of least action 


In studying the motion of material particles, we shall start from the Principle of Least 
Action. The principle of least action is defined, as we know, by the statement that for each 
mechanical system there exists a certain integral S, called the action, which has a minimum 
value for the actual motion, so that its variation ôS is zero.t 

To determine the action integral for a free material particle (a particle not under the 
influence of any external force), we note that this integral must not depend on our choice of 
reference system, that is, it must be invariant under Lorentz transformations. Then it follows 
that it must depend on a scalar. Furthermore, it is clear that the integrand must be a dif- 
ferential of the first order. But the only scalar of this kind that one can construct for a free 
particle is the interval ds, or «ds, where « is some constant. So for a free particle the action 
must have the form 


b 
S = —a | ds, 


where ,{° is an integral along the world line of the particle between the two particular events 
of the arrival of the particle at the initial position and at the final position at definite times 
t, and f,, i.e. between two given world points; and « is some constant characterizing the 
particle. It is easy to see that « must be a positive quantity for all particles. In fact, as we 
saw in § 3, ,[^ds has its maximum value along a straight world line; by integrating along 
a curved world line we can make the integral arbitrarily small. Thus the integral a? ds with 
the positive sign cannot have a minimum; with the opposite sign it clearly has a minimum, 
along the straight world line. 
The action integral can be represented as an integral with respect to the time 
t2 
S= Í Lat. 

ti 

The coefficient L of dt represents the Lagrange function of the mechanical system. With the 


aid of (3.1), we find: 
2 
2 
S=- fac |1- 5ds 
c 
ti 


t Strictly speaking, the principle of least action asserts that the integral S must be a minimum only for 
infinitesimal lengths of the path of integration. For paths of arbitrary length we can say only that S must be 
an extremum, not necessarily a minimum. (See Mechanics, § 2.) 
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where v is the velocity of the material particle. Consequently the Lagrangian for the particle 
iS 
L = -acV1—v?J/c?. 

The quantity x, as already mentioned, characterizes the particle. In classical mechanics 
each particle is characterized by its mass m. Let us find the relation between « and m. It can 
be determined from the fact that in the limit as ¢ — oo, our expression for L must go over 
into the classical expression L — mv?/2. To carry out this transition we expand L in powers 
of v/c. Then, neglecting terms of higher order, we find 


L=-ac ,/l-+ 2 -act =~. 


Constant terms in the Lagrangian do not affect the equation of motion and can te 
omitted. Omitting the constant ac in L and comparing with the classical expression 
L = mv?/2, we find that a = mc. 


Thus the action for a free material point is 
b 


| ds (8.1) 


S 2 -mce 


and the Lagrangian is 


L= -~me I- (8.2) 


§ 9. Energy and momentum 


By the momentum of a particle we can mean the vector p = OL /ðv (OL /ðv is the symbolic 
representation of the vector whose components are the derivatives of L with respect to the 
corresponding components of v). Using (8.2), we find: 


niv 
p-———. (9.1) 


m 
For small velocities (v < c) or, in the limit as c — oo, this expression goes over into the 
classical p = mv. For v = c, the momentum becomes infinite. 
The time derivative of the momentum is the force acting on the particle. Suppose the 
velocity of the particle changes only in direction, that is, suppose the force is directed 
perpendicular to the velocity. Then 


=. (9.2) 


If the velocity changes only in magnitude, that is, if the force is parallel to the velocity, then 


dp m dv (9.3) 


dt Es v? 3 qr 
I-3 


We see that the ratio of force to acceleration is different in the two cases. 


26 RELATIVISTIC MECHANICS § 9 


The energy & of the particle is defined as the quantityt 
€=p-y-L. 
Substituting the expressions (8.2) and (9.1) for L and p, we find 


mc? 


n 
v 

it. 
E c? 


This very important formula shows, in particular, that in relativistic mechanics the energy 
of a free particle does not go to zero for v = 0, but rather takes on a finite value 
é — mc?. (9.5) 
This quantity is called the rest energy of the particle. 
For small velocities (v/c < 1), we have, expanding (9.4) in series in powers of v/e, 
2 


Ez met 2, 
2 


e= (9.4) 


which, except for the rest energy, is the classical expression for the kinetic energy of a 
particle. 

We emphasize that, although we speak of a "particle", we have nowhere made use of the 
fact that it is "elementary". Thus the formulas are equally applicable to any composite body 
consisting of many particles, where by m we mean the total mass of the body and by v the 
velocity of its motion as a whole. In particular, formula (9.5) is valid for any body which is at 
rest as a whole. Wecall attention to the fact that in relativistic mechanics the energy of a free 
body (i.e. the energy of any closed system) is a completely definite quantity which is always 
positive and is directly related to the mass of the body. In this connection we recall that in 
classical mechanics the energy of a body is defined only to within an arbitrary constant, and 
can be either positive or negative. 

The energy of a body at rest contains, in addition to the rest energies of its constituent 
particles, the kinetic energy of the particles and the energy of their interactions with one 
another. In other words, mc? is not equal to X m, c? (where m, are the masses of the particles), 
and so m is not equal to Z m,. Thus in relativistic mechanics the law of conservation of mass 
does not hold: the mass of a composite body is not equal to the sum of the masses of its 
parts. Instead only the law of conservation of energy, in which the rest energies of the particlcs 
are included, is valid. 

Squaring (9.1) and (9.4) and comparing the results, we get the following relation between 
the energy and momentum of a particle: 

2 
T =p +m’. (9.6) 


The energy expressed in terms of the momentum is called the Hamiltonian function #: 


H — c p +m e. (9.7) 
For low velocities, p « mc, and we have approximately 
2 
= mc? + LEA 
i 2m 


i.e., except for the rest energy we get the familiar classical expression for the Hamiltonian. 
t See Mechanics, § 6. 
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From (9.1) and (9.4) we get the following relation between the energy, momentum, and 
velocity of a free particle: 


For v = c, the momentum and energy of the particle become infinite. This means that a 
particle with mass m different from zero cannot move with the velocity of light. Nevertheless, 
in relativistic mechanics, particles of zero mass moving with the velocity of light can exist. 
From (9.8) we have for such particles: 

p= (9.9) 
The same formula also holds approximately for particles with nonzero mass in the so-called 
ultrarelativistic case, when the particle energy & is large compared to its rest energy mc’. 

We now write all our formulas in four-dimensional form. According to the principle of 
least action, 


b 
ôS = -með Í ds =0. 


To set up the expression for ôS, we note that ds = J dx; dx! and therefore 


b b 
i 
ôS = -mc Í Pn Lae —mc ll u;dôx’. 
ds 
Integrating by parts, we obtain 
lb : du; 
6S = —mcu,6x'| -- mc f 5x! EM ds. (9.10) 


As we know, to get the equations of motion we compare different trajectories between the 
same two points, i.e. at the limits (ôx, = (óx^), = 0. The actual trajectory is then deter- 
mined from the condition 6S = 0. From (9.10) we thus obtain the equations du,/ds = 0; that 
is, a constant velocity for the free particle in four-dimensional form. 

To determine the variation of the action as a function of the coordinates, one must consider 
the point a as fixed, so that (6x'), = 0. The second point is to be considered as variable, but 
only actual trajectories are admissible, i.e., those which satisfy the equations of motion. 
Therefore the integral in expression (9.10) for 5S is zero. In place of (5x'), we may write 
simply óx', and thus obtain 


ôS = — mcujóx. (9.11) 
The four-vector 
oS 
MEM es 9.12 
Pi oxi ( ) 


is called the momentum four-vector. As we know from mechanics, the derivatives ¢S/éx, 
@S/dy, ðS/ðz are the three components of the momentum vector p of the particle, while the 
derivative —0S/dt is the particle energy £. Thus the covariant components of the four- 


t For example, light quanta and neutrinos. 
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momentum are p; = (&/c, — p), while the contravariant components aret 


p' — (él[c, p). (9.13) 
From (9.11) we see that the components of the four-momentum of a free particle are: 
p! = meu’. (9.14) 


Substituting the components of the four-velocity from (7.2), we see that we actually get 
expressions (9.1) and (9.4) for p and @. 

Thus, in relativistic mechanics, momentum and energy are the components of a single 
four-vector. From this we immediately get the formulas for transformation of momentum 
and energy from one inertial system to another. Substituting (9.13) in the general formulas 
(6.1) for transformation of four-vectors, we find: 


y P:-= p, €=——— 5, (9.15) 


where p,, Py, p, are the components of the three-dimensional vector p. 
From the definition (9.14) of the four-momentum, and the identity u'v; = 1, we have, for 
the square of the four-momentum of a free particle: 


pip'= m?c? (9.16) 
Substituting the expressions (9.13), we get back (9.6). 


By analogy with the usual definition of the force, the force four-vector is defined as the 
derivative: 


_ dp! du! 
o eeu i (9.17) 
ds 
Its components satisfy the identity g,u’ = 0. The components of this four-vector are expressed 
in terms of the usual three-dimensional force vector f = dp/dr: 
P f-v f 
d Nr ME 

x n TNT J a 

c? c 
The time component is related to the work done by the force. 


The relativistic Hamilton-Jacobi equation is obtained by substituting the derivatives 
— QS/Ox' for p; in (9.16): 


(9.18) 


êS ôS — „ôS OS — 
dx, Ox .? Ox Ok 
or, writing the sum explicitly: 


| /OSN /0SN /0SN. f/0SN 
ala) -(5) - 6) -() - oe 


+ We call attention to a mnemonic for remembering the definition of the physical four-vectors: the 
contravariant components are related to the corresponding three-dimensional vectors (r for x', p for p') with 
the “right”, positive sign. 


mic, (9.19) 
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The transition to the limiting case of classical mechanics in equation (9.19) is made as 
follows. First of all we must notice that just as in the corresponding transition with (9.7), 
the energy of a particle in relativistic mechanics contains the term mc?, which it does not in 
classical mechanics. Inasmuch as the action S is related to the energy by 4 = —(0S/0t), in 
making the transition to classical mechanics we must in place of S substitute a new action 
S' according to the relation: 

S-S'—mci. 
Substituting this in (9.20), we find 


EEEE- 
2m (NOx . Oy ez 2mc? x) at 


In the limit as c — oo, this equation goes over into the classical Hamilton-Jacobi equation. 


§ 10. Transformation of distribution functions 


In various physical problems we have to deal with distribution functions for the momenta 
of particles: f(p) dp, dp, dp, is the number of particles having momenta with components 
in given intervals dp,, dp,, dp. (or, as we say for brevity, the number of particles in a given 
volume element d?p = dp,dp,dp, in “momentum space"). We are then faced with the problem 
of finding the law of transformation of the distribution function f(p) when we transform 
from one reference system to another. 

To solve this problem, we first determine the properties of the "volume element" 
dp,dp,dp, with respect to Lorentz transformations. If we introduce a four-dimensional 
coordinate system, on whose axes are marked the components of the four-momentum of a 
particle, then dp,dp,dp. can be considered as the zeroth component of an element of the 
hypersurface defined by the equation p'p; = m?c?. The element of hypersurface is a four- 
vector directed along the normal to the hypersurface; in our case the direction of the normal 
obviously coincides with the direction of the four-vector p;. From this it follows that the 
ratio 


4 


dp,dp,dp, 
é 
is an invariant quantity, since it is the ratio of corresponding components of two parallel 


four-vectors.T 
The number of particles, f dp, dp, dp,, is also obviously an invariant, since it does not 


(10.1) 


t The integration with respect to the element (10.1) can be expressed in four-dimensional form by means 
of the ó-function (cf. the footnote on p. 70) as an integration with respect to 


2 spp! — mci)d'p, d'p = dp'dp'dp'dp*. (10.1) 
€ 


The four components p' are treated as independent variables (with p? taking on only positive values). Formula 
(10.1a) is obvious from the following representation of the delta function appearing in it: 


ó(p'p,— m?c?) = (ov — 2 - LE Jj + a(n - I} (10.15) 


where 4 = cv p? Tic. This formula in turn follows from formula (v) of the footnote on p. 70. 
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depend on the choice of reference frame. Writing it in the form 
dp,dp,dp, 
& 
and using the invariance of the ratio (10.1), we conclude that the product f(p)é is invariant. 


Thus the distribution function in the K’ system is related to the distribution function in the 
K system by the formula 


SPE 


fpe 
e’ 
where p and & must be expressed in terms of p' and &" by using the transformation formulas 
(9.15). 
Let us now return to the invariant expression (10.1). If we introduce "spherical co- 
ordinates” in momentum space, the volume element dp, dp, dp, becomes p dp do, where do is 


the element of solid angle around the direction of the vector p. Noting that pdp = £dé[c? 
[from (9.6), we have: 


JP) = (10.2) 


p’dpdo _ pd&do 
é e` 


Thus we find that the expression 


pd&do (10.3) 
is also invariant. 

The notion of a distribution function appears in a different aspect in the kinetic theory 
of gases: the product f(r, p) dp,dp,dp. dV is the number of particles lying in a given volume 
element dV and having momenta in definite intervals dp,, dp,, dp.. The function f(r, p) is 
called the distribution function in phase space (the space of the coordinates and momenta 
of the particle), and the product of differentials dr — d?p dV is the element of volume 
of this space. We shall find the law of transformation of this function. 

In addition to the two reference systems K and K’, we also introduce the frame Ky in 
which the particles with the given momentum are at rest; the proper volume dV, of the 
element occupied by the particles is defined relative to this system. The velocities of the 
systems K and K' relative to the system Kọ coincide, by definition, with the velocities v 
and v' which these particles have in the systems K and K’. Thus, according to (4.6), we have 


p? v? 
dV = dV, |1—-4. dV’ = dV, ]1— 5, 
[4 [4 


from which 
dV _ E 
dV' & 
Multiplying this equation by the equation d?p/d?p' = &/&', we find that 
dt = dv, (10.4) 


i.e. the element of phase volume is invariant. Since the number of particles f dt is also 
invariant, by definition, we conclude that the distribution function in phase space is an 
invariant: 


f(r’, p) = f(r, p), (10.5) 


where r’, p' are related to r, p by the formulas for the Lorentz transformation. 
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§ 11. Decay of particles 


Let us consider the spontaneous decay of a body of mass M into two parts with masses 
m, and m;. The law of conservation of energy in the decay, applied in the system of reference 
in which the body is at rest. gives.t 


M Cut 6. (11.1) 


where & 19 and & 29 are the energies of the emerging particles. Since 6,9 > m, and &,9 > m;, 
the equality (11.1) can be satisfied only if M > m, +m, i.e. a body can disintegrate spon- 
taneously into parts the sum of whose masses is less than the mass of the body. On the other 
hand, if M < m, 4 m;, the body is stable (with respect to the particular decay) and does not 
decay spontaneously. To cause the decay in this case, we would have to supply to the body 
from outside an amount of energy at least equal to its “binding energy" (m, +m, — M). 

Momentum as well as energy must be conserved in the decay process. Since the initial 
momentum of the body was zero, the sum of the momenta of the emerging particles must 
be zero: Pio + P29 = 0. Consequently p79 = p2,, or 


Ela- m? = £3- m3. (11.2) 


The two equations (11.1) and (11.2) uniquely determine the energies of the emerging 
particles: 

M?+m?-m? ? _M?°-mi+m} 
— 2M , 20 7 2M x 


In a certain sense the inverse of this problem is the calculation of the total energy M of 
two colliding particles in the system of reference in which their total momentum is zero. 
(This is abbreviated as the ''system of the center of inertia" or the '*C-system".) The com- 
putation of this quantity gives a criterion for the possible occurrence of various inelastic 
collision processes, accompanied by a change in state of the colliding particles, or the 
"creation" of new particles. A process of this type can occur only if the sum of the masses 
of the “reaction products" does not exceed M. 

Suppose that in the initial reference system (the “laboratory” system) a particle with mass 
m, and energy 6, collides with a particle of mass m; which is at rest. The total energy of the 
two particles is 


PES (11.3) 


€=€,+6, —-é tm; 


and their total momentum is p = p, + p; = pı. Considering the two particles together as a 
single composite system, we find the velocity of its motion as a whole from (9.8): 


p Pi 
PEL zi 11.4 
é 6, Tm; ( ) 
This quantity is the velocity of the C-system with respect to the laboratory system (the L- 


system). 
However, in determining the mass M, there is no need to transform from one reference 


t In $8 11-13 we set c = 1. In other words the velocity of light is taken as the unit of velocity (so that 
the dimensions of length and time become the same). This choice is a natural one in relativistic mechanics 
and greatly simplifies the writing of formulas. However, in this book (which also contains a considerable 
amount of nonrelativistic theory) we shall not usually use this system of units, and will explicitly indicate 
when we do. 

If c has been set equal to unity in formulas, it is easy to convert back to ordinary units: the velocity is 
introduced to assure correct dimensions. 


32 RELATIVISTIC MECHANICS § 1] 


frame to the other. Instead we can make direct use of formula (9.6), which is applicable to 
the composite system just as it is to each particle individually. We thus have 


M? = £? - p? =(&,+m,)’—(é{—m)}), 
from-which 
M? = mi 4 mir2m,4,. (11.5) 


PROBLEMS 


1. A particle moving with velocity V dissociates “in flight" into two particles. Determine the 
relation between the angles of emergence of these particles and their energies. 

Solution: Let £ be the energy of one of the decay particles in the C-system [i.e. 4,9 or £59 in 
(11.3)], £ the energy of this same particle in the L-system, and @ its angle of emergence in the L- 
system (with respect to the direction of V). By using the transformation formulas we find: 


4 — Vp cos 0 
fo SS > S 
Vi-vV? 
so that 
cos 8 $—&V1—V? a 
OS U = —— a., 
VV 62—m? ) 
For the determination of 4 from cos 0 we then get the quadratic equation 

EX1 — V? cos? 0)—28£ V 1 — V2-+ £2(1 — V2) + Vm? cos? 0 = 0, (2) 


which has one positive root (if the velocity vo of the decay particle in the C-system satisfies vg > V) 
or two positive roots (if vo « V). 

The source of this ambiguity is clear from the following graphical construction. According to 
(9.15), the momentum components in the L-system are expressed in terms of quantities referring to 
the C-systeni by the formulas 

Po COS 09-- 69 V . 
Picea Py = Po Sin Oo. 
V1—V? ” 
Eliminating 0o, we get 
pit (prV 1 —V?— EV} = pi. 


(a) Vev (b) V»v, 


Qart A 
L4 


Fic. 3. 


With respect to the variables p,, py, this is the equation of an ellipse with semiaxes po/ V1 — V2, po. 
whose center (the point O in Fig. 3) has been shifted a distance éo V/V 1—- V? from the point 
p =0 (point A in Fig. 3).t 

If V > po/@o = vo, the point A lies outside the ellipse (Fig. 3b), so that for a fixed angle 0 the 
vector p (and consequently the energy 4) can have two different values. It is also clear from the 
construction that in this case the angle 0 cannot exceed a definite value Omax (corresponding to 
the position of the vector p in which it is tangent to the ellipse). The value of Omax is most easily 


T In the classical limit, the ellipse reduces to a circie. (See Mechanics, § 16.) 
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determined analytically from the condition that the discriminant of the quadratic equation (2) 
go to zero: 
. poVl—V? 
Sin Omax = ———7——- 
ma mV 


2. Find the energy distribution of the decay particles in the L-system. 
Solution: In the C-system the decay particles are distributed isotropically in direction, i.e. the 
number of particles within the element of solid angie dog = 27 sin Oo d@p is 


| 
4n 
The energy in the L-system is given in terms of quantities referring to the C-system by 
_ opo V cos 6o 
VV? 


dN =: — dog = $ |d cos Ool. (1) 


é 


and runs through the range of values from 
oo Vpo to Cor Fpo 
v1—yvi V1— V3 
Expressing d|cos 8o) in terms of dé, we obtain the normalized energy distribution (for each of the 


two types of decay particles): 


l 
dN = 5; 


Po 


Vi—V?de. 


3. Determine the range of values in the L-system for the angle between the two decay particles 
(their separation angle) for the case of decay into two identical particles. 

Solution: In the C-system, the particles fly off in opposite directions, so that 6:59 = 1 — 82s = Op. 
According to (5.4), the connection between angles in the C- and L-systems is given by the formulas: 


cot 8 vg COS Oo +V cot 8 — to cos Og +V 
= — 2 = — 2] 
: vo Sin Oo V 1 — V? to Sin Oo V 1— V? 


(since vio = tgo = vg in the present case). The required separation angle is © = 01 —6,, and a 
simple calculation gives: 
V?—2 : V?i$ sin? Oo 


2Vrg9 V 1 — V? sin Oo ` 


An examination of the extreme for this expression gives the following ranges of possible values of 


O: 


cot O = 


for v<wi 2an (F zn) «Ocm; 


f V 2.o [9] inl l-V? £7. 
OT Ug < < V1—v2 < < sin 2 » 


v I-02 V 


4. Find the angular distribution in the L-system for decay particles of zero mass. 
Solution: According to (5.6) the connection between the angles of emergence in the C- and 
L-systems for particles with m — O is 


for (eeu 0 <0 rac evi) <5 


cos Oo = TF 


Substituting this expression in formula (1) of Problem 2, we find: 


(1 — V?)do 


dN = 4n(1 — V cos 0)? 
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5. Find the distribution of separation angles in the L-system for a decay into two particles of 
zero mass. 

Solution: The relation between the angles of emergence, 6;, 02 in the L-system and the angles 
910 = 0o, 929 = n— bo in the C-system is given by (5.6), so that we have for the separation angle 
© = 0, +62: 


and conversely, 


. I-V? 16 
cos = |1- Er eon S. 


Substituting this expression in formula (1) of problem 2, we find: 


1—V? do 


16xV ENTE 


The angle © takes on values from z to Omin = 2cos^! V. 


dN 


6. Determine the maximum energy which can be carried off by one of the decay particles, when 
a particle of mass M at rest decays into three particles with masses 7, m2, and m3. 

Solution: The particle m, has its maximum energy if the system of the other two particles mz and 
ms has the least possible mass; the latter is equal to the sum 7 3-73 (and corresponds to the case 
where the two particles move together with the same velocity). Having thus reduced the problem 
to the decay of a body into two parts, we obtain from (11.3): 


M?-t mi — (ma +m) 
2M ` 


i mar = 


§ 12. Invariant cross-section 


Collision processes are characterized by their invariant cross-sections, which determine 
the number of collisions (of the particular type) occurring between beams of colliding 
particles. 

Suppose that we have two colliding beams; we denote by n, and n, the particle densities 
in them (i.e. the numbers of particles per unit volume) and by v, and v; the velocities of the 
particles. In the reference system in which particle 2 is at rest (or, as one says, in the rest 
frame of particle 2), we are dealing with the collision of the beam of particles 1 with a stationary 
target. Then according to the usual definition of the cross-section c, the number of collisions 
occurring in volume dV in time dt is 


dv = ovn, no dVdt, 


where v,e, is the velocity of particle 1 in the rest system of particle 2 (which is just the definition 
of the relative velocity of two particles in relativistic mechanics). 

The number dv is by its very nature an invariant quantity. Let us try to express it in a form 
which is applicable in any reference system: 


dv = An,n,dVdt, (12.1) 


where A is a number to be determined, for which we know that its value in the rest frame of 
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one of the particles is v,,, 0. We shall always mean by c precisely the cross-section in the 
rest frame of one of the particles, i.e. by definition, an invariant quantity. From its definition, 
the relative velocity v,e is also invariant. 

In the expression (12.1) the product dVdt is an invariant. Therefore the product An, n; 
must also be an invariant. 

The law of transformation of the particle density n is easily found by noting that the 
number of particles in a given volume element dV, ndV, is invariant. Writing ndV = ng dV, 
(the index 0 refers to the rest frame of the particles) and using formula (4.6) for the trans- 
formation of the volume, we find: 


No 
n —————— (12.2) 
Vi-v? 
or n = ng é[m, where & is the energy and m the mass of the particles. 


Thus the statement that Av, n, is invariant is equivalent to the invariance of the expression 
A6 , 65. This condition is more conveniently represented in the form 


$16; — $16; 
PuP2 — €:62—Py P; 
where the denominator is an invariant—the product of the four-momenta of the two 
particles. 
In the rest frame of particle 2, we have £, = m;, p; = 0, so that the invariant quantity 


(12.3) reduces to A. On the other hand, in this frame A = ov,,,. Thus in an arbitrary reference 
system, 


= inv, (12.3) 


(12.4) 


To give this expression its final form, we express v,e in terms of the momenta or velocities 
of the particles in an arbitrary reference frame. To do this we note that in the rest frame of 
particle 2, 


008 m 
PiiP2 Vi-v2, 2° 
Then 
2,2 
mim? 
va=,/ 1- ne (12.5) 
! J (Ppi)? 


Expressing the quantity p,; pi = &,4;—p,' p; in terms of the velocities v, and v; by using 
formulas (9.1) and (9.4): 

m l-vyvy 
* tav) = 08)" 
and substituting in (12.5), after some simple transformations we get the following expression 
for the relative velocity: 


PuPi = m 


2 
= Vv, - vay! = (vi x V2) 
rel 1-v v 


(12.6) 


(we note that this expression is symmetric in v; and v;, i.e. the magnitude of the relative 
velocity is independent of the choice of particle used in defining it). 
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Substituting (12.5) or (12.6) in (12.4) and then in (12.1), we get the final formulas for 
solving our problem: 
_ YOu = mind 


d 
d £,6; 


n,n, dVdt (12.7) 


or 
dv = ox (v, — v) — (v, x v5)? nyn,dVat (12.8) 
(W. Pauli, 1933). 


If the velocities v, and v; are collinear, then v, x v; — 0, so that formula (12.8) takes the 
form: 


dv = olv, — v;|n; n; dVdt. (12.9) 


PROBLEM 


Find the ‘element of length" in relativistic “‘velocity space”. 

Solution: The required line element dl, is the relative velocity of two points with velocities v and 
v -- dv. We therefore find from (12.6) 
(dv? —(vxdvy? | dv? 

a= ^ (1—8y 
where 6, ¢ are the polar angle and azimuth of the direction of v. If in place of v we introduce the 
new variable x through the equation v = tanh x, the line element is expressed as: 
dl? = dy? E sinh? x(d6? -- sin? 8- d¢?). 


From the geometrical point of view this is the line element in three-dimensional Lobachevskii 
space—the space of constant negative curvature (see (111.12)). 


2 
dt = ES i (d6? -- sin? 6- dé?), 


§ 13. Elastic collisions of particles 


Let us consider, from the point of view of relativistic mechanics, the elastic collision of 
particles. We denote the momenta and energies of the two colliding particles (with masses 
m, and mj) by pi, &, and p;, &;; we use primes for the corresponding quantities after 
collision. The laws of conservation of momentum and energy in the collision can be written 
together as the equation for conservation of the four-momentum: 


pit pi = pit pi. (13.1) 
From this four-vector equation we construct invariant relations which will be helpful in 
further computations. To do this we rewrite (13.1) in the form: 


Pit pr—Pi = ph- 
and square both sides (i.e. we write the scalar product of each side with itself). Noting that 


the squares of the four-momenta pj and p} are equal to m?, and the squares of pj and p? 
are equal to m, we get: 


mit pipai—PiPi-— Papi. (13.2) 
Similarly, squaring the equation pi+ pi— p; = pj, we get: 
mj puPi— PiPi- pupi 0. (13.3) 
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Let us consider the collision in a reference system (the L-system) in which one of the 
particles (m) was at rest before the collision. Then p; = 0, £} = m,, and the scalar products 
appearing in (13.2) are: 

PiuiPi = m, 
PrP: = 264, (13.4) 
Pupi = 6,6,—Pi Pi = 6,61—pipi cos O, 
where 8, is the angle of scattering of the incident particle m,. Substituting these expressions 
in (13.2) we get: 


tou sta m) mH n (13.5) 


Similarly, we find from (13.3): 


— (6, mj(8,—m:;) 
Pip? 

where 0, is the angle between the transferred momentum p; and the momentum of the 

incident particle p,. 


The formulas (13.5-6) relate the angles of scattering of the two particles in the L-system 
to the changes in their energy in the collision. Inverting these formulas, we can express the 


energies &;, £; in terms of the angles 0, or 0,. Thus, substituting in (13.6) p, = ./6? — mi, 
pi /(6;y — m3 and squaring both sides, we find after a simple computation: 
(€,+m,) 4 (62 — m2) cos? 0, 
(6, 4 m? -(€1— m?) cos? 6," 


cos 0, ; (13.6) 


(13.7) 


6% = m 


Inversion of formula (13.5) leads in the general case to a very complicated formula for £4 
in terms of 0,. 

We note that if m, > my, i.e. if the incident particle is heavier than the target particle, the 
scattering angle 0, cannot exceed a certain maximum value. It is easy to find by elementary 
computations that this value is given by the equation 


: ms 
sin 0 


(13.8) 


1 max = E 
m, 
which coincides with the familiar classical result. 

Formulas (13.5-6) simplify in the case when the incident particle has zero mass: m, = 0, 
and correspondingly p, = &,, p, = 4. For this case let us write the formula for the energy 
of the incident particle after the collision, expressed in terms of its angle of deflection: 

! m 
= SS (13.9) 
1 — cos 0, +— 
H é, 

Let us now turn once again to the general case of collision of particles of arbitrary mass. 
The collision is most simply treated in the C-system. Designating quantities in this system 
by the additional subscript 0, we have Pio = — P20= po. From the conservation of momen- 
tum, during the collision the momenta of the two particles merely rotate, remaining equal in 
magnitude and opposite in direction. From the conservation of energy, the value of each of 
the momenta remains unchanged. 
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Let y be the angle of scattering in the C-system—the angle through waich the momenta 
Pio and p;o are rotated by the collision. This quantity completely determines the scattering 
process in the C-system, and therefore also in any other reference system. It is also con- 
venient in describing the collision in the L-system and serves as the single parameter which 
remains undetermined after the conservation of momentum and energy are applied. 

We express the final energies of the two particles in the L-system in terms of this para- 
meter. To do this we return to (13.2), but this time write out the product p,;p' in the C- 
system: 

Pip) = £10810 P10" Pio = & 10 — Po COS z = po(1 —cos z)-- mi 
(in the C-system the energies of the particles do not change in the collision: 4,9 = 6&9). 
We write out the other two products in the L-system, i.e. we use (13.4). As a result we get: 
E, —&, = —(pilm;Y1-— cos y). We must still express på in terms of quantities referring to 
the L-system. This is easily done by equating the values of the invariant p,;p; in the L- and 
C-systems: 
610620 —P10'P20 = Ó M2, 


v (po miY(po +m?) = & m2 — po. 


Solving the equation for på, we get: 


or 


2 2 2 
m;(é6j— mi) 


2. 
iss mit mict2m;6, (13.10) 
Thus, we finally have: 
; m;(éi—mi 
éi =é — = (1- y). ; 
! ! m? 4 ml-c-2m;4, Dur) (13.11) 


The energy of the second particle is obtained from the conservation law: £, +m, = 6,+6%4. 
Therefore 

m(@;—m?) 
mi+m2+2m,é, 

The second terms in these formulas represent the energy lost by the first particle and trans- 
ferred to the second particle. The maximum energy transfer occurs for y = z, and is equal to 

2 2 
Fina = Fin aia = ar (13.13) 

The ratio of the minimum kinetic energy of the incident particle after collision to its 

initial energy is: 


4 = m+ (1— cos 7). (13.12) 


D 


Imin TMi 0 (m,—mj* 


&,—-m,  mleémic2m;£, 
In the limiting case of low velocities (when & z m+ mv?/2), this relation tends to a constant 


limit, equal to 
(mm 2 
m +m, ` 


In the opposite limit of large energies &,, relation (13.14) tends to zero; the quantity 6 min 
tends to a constant limit. This limit is 


(13.14) 


mimi 


2m, 
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Let us assume that m; » m,, i.e. the mass of the incident particle is small compared to 
the mass of the particle at rest. According to classical mechanics the light particle could 
transfer only a negligible part of its energy (see Mechanics, $ 17). This is not the case in 
relativistic mechanics. From formula (13.14) we see that for sufficiently large energies &, 
the fraction of the energy transferred can reach the order of unity. For this it is not sufficient 
that the velocity of m, be of order 1, but one must have &, ~ my, i.e. the light particle must 
have an energy of the order of the rest energy of the heavy particle. 

A similar situation occurs for m, « m, i.e. when a heavy particle is incident on a light 
one. Here too, according to classical mechanics, the energy transfer would be insignificant. 
The fraction of the energy transferred begins to be significant only for energies £, ~ m?/my. 
We note that we are not talking simply of velocities of the order of the light velocity, but 
of energies large compared to r1, i.e. we are dealing with the ultrarelativistic case. 


PROBLEMS 


1. The triangle ABC in Fig. 4 is formed by the momentum vector p of the impinging particle 
and the momenta p'i, p’2 of the two particles after the collision. Find the locus of the points C 
corresponding to all possible values of p^, P'a. 

Solution: The required curve is an ellipse whose semiaxes can be found by using the formulas 
obtained in problem 1 of § 11. In fact, the construction given there determined the locus of the 
vectors p in the L-system which are obtained from arbitrarily directed vectors po with given length 
Po in the C-system. 


(a) m,»m, (b) m< ms 


Fic. 4. 


Since the absolute values of the momenta of the colliding particles are identical in the C-system, 
and do not change in the collision, we are dealing with a similar construction for the vector p';. for 
which 

mV 
Po 9 Pio = P20 — ———— 
° VI-V? 


in the C-system where V is the velocity of particle m, in the C-system, coincides in magnitude with 
tne velocity of the center of inertia, and is equal to V = p,/(&; =z) (see (11.4)). As a result we find 
that the minor and major semiaxes of the ellipse are 


pe mp Po _ mp (E, + m2) 
5o mit mitImé,  J1-V° mitmi+ Imé, 
(the first of these is, of course, the same as (13.10)). 
For 8, = 0, the vector p’, coincides with pi, so that the distance AB is equal to p,. Comparing pi 


with the length of the major axis of the ellipse, it is easily shown that the point A lies outside the 
ellipse if mi > mz (Fig. 4a), and inside it if 7 < mz (Fig. 4b). 


2. Deterinine the minimum separation angle Omin of two particles after collision if the masses 
of the two particles are the same (m; = m = m). 
Solution: Vf m, = ms, the point A of the diagram lies on the ellipse, while the minimum separation 
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angle corresponds to the situation where point C is at the end of the minor axis (Fig. 5). From the 
construction it is clear that tan (Omin/2) is the ratio of the lengths of the semiaxes, and we find: 


t Omn /— 2m 
ano 5 E dm 


61,—m 
6,43 


or 


cos Omin = 


^ 
C70 


min 


3. For the collision of two particles of equal mass ən, express &",, &’2, x in terms of the angle 0, 
of scattering in the L-system. 

Solution: inversion of formula (13.5) in this case gives: 
ou (0i m) (61m) cos? 0, 
^ (644m —(6,—n)cos? 0, 


(6,2? —n?)sin? 0, 


éi E = a 
! ;—m* 2m-4 (6;—1) sin? 0, 


Comparing with the expression for £’ in terms of x: 


eV -4- ém 


(1 —cos x), 


we find the angle of scattering in the C-system: 
2m — (£, + 3m) sin? 6, 


OS = Fn + (6, —m) sin’, | 


§ 14. Angular momentum 


As is well known from classical mechanics, for a closed system, in addition to conserva- 
tion of energy and momentum, there is conservation of angular momentum, that is, of the 
vector 

M=) rxp 
where r and p are the radius vector and momentum of the particle; the summation runs over 
all the particles making up the system. The conservation of angular momentum is a con- 
sequence of the fact that because of the isotropy of space, the Lagrangian of a closed system 
does not change under a rotation of the system as a whole. 

By carrying through a similar derivation in four-dimensional form, we obtain the 
relativistic expression for the angular momentum. Let x! be the coordinates of one of the 
particles of the system. We make an infinitesimal rotation in the four-dimensional space. 
Under such a transformation, the coordinates x' take on new values x" such that the 
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differences x — x‘ are linear functions 

x — xi = x, óQ* (14.1) 
with infinitesimal coefficients 6Q,,. The components of the four-tensor 6Q,, are connected 
to one another by the relations resulting from the requirement that, under a rotation, the 
length of the radius vector must remain unchanged, that is. x; x^ = x;x^. Substituting for 
x" from (14.1) and dropping terms quadratic in 6Q,, as infinitesimals of higher order, we 
find 

xix SNQ, = 0. 
This equation must be fulfilled for arbitrary x'. Since x'x^ is a symmetric tensor, 6Q,, must 


be an antisymmetric tensor (the product of a symmetrical and an antisymmetrical tensor is 
clearly identically zero). Thus we find that 


óQ,; = — dx. (14.2) 
The change in the action for an infinitesimal change of coordinates of the initial point a 
and the final point 5 of the trajectory has the form (see 9.11): 


ôS = —Y p'óx,t 


(the summation extends over all the particles of the system). In the case of rotation which 
we are now considering, ôx; = óQ, x*, and so 


ôS = —50, Y p'x* p. 


If we resolve the tensor Xp'x* into symmetric and antisymmetric parts, then the first of 
these when multiplied by an antisymmetric tensor gives identically zero. Therefore, taking 
the antisymmetric part of Zp'x*, we can write the preceding equality in the form 


5S = —óQ,1Y (pio — px). (14.3) 


For a closed system the action, being an invariant, is not changed by a rotation in 4-space. 
This means that the coefficients of 6Q, in (14.3) must vanish: 


2 (p'x* — px’) = Y (p'x* — p* x^), 
Consequently we see that for a closed system the tensor 
M* =} (x'p* — x*p!) (14.4) 


This antisymmetric tensor is called the four-tensor of angular momentum. The space 
components of this tensor are the components of the three-dimensional angular momentum 
vector M = ir x p: 


M?3 X M, —-M? E M,, M? = M.. 


The components M?', M??, M?? form a vector E(tp—é&r/c°). Thus, we can write the 
components of the tensor M * in the form: 


Mik = | Box (i- EJ — m| (14.5) 
(Compare (6.10).) 


Because of the conservation of M‘* for a closed system, we have, in particular, 


ór 
X (»- 2j — const. 
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Since, on the other hand, the total energy X £ is also conserved, this equality can be written 
in the form ye iy 
r tòp 
=- t = const. 
Xe 2g 


(Quantities referring to different particles are taken at the same time f). 
From this we see that the point with the radius vector 


R= uf (14.6) 
moves uniformly with the velocity jy 
cp 
V = L—— 14.7 
ve" (14.7) 


which is none other than the velocity of motion of the system as a whole. [It relates the total 
energy and momentum, according to formula (9.8).] Formula (14.6) gives the relativistic 
definition of the coordinates of the center of inertia of the system. If the velocities of all the 
particles are small compared to c, we can approximately set & z mc? so that (14.6) goes 
over into the usual classical expression 
R- x ma 
ym 

We note that the components of the vector (14.6) do not constitute the space components 
of any four-vector, and therefore under a transformation of reference frame they do not 
transform like the coordinates of a point. Thus we get different points for the center of 
inertia of a given system with respect to different reference frames. 


PROBLEM 


Find the connection between the angular momentum M of a body (system of particles) in the 
reference frame K in which the body moves with velocity V, and its angular momentum M© in 
the frame Ko in which the body is at rest as a whole; in both cases the angular momentum is defined 
with respect to the same point—the center of inertia of the body in the system Ko.ł 

Solution: The K, system moves relative to the K system with velocity V; we choose its direction 
for the x axis. The components of M'* that we want transform according to the formulas (see 
problem 2 in $ 6): 


M2 + y M (0)02 M + E M (03 
M? = E z^ fik € 


y? 7 
Js Js 


Since the origin of coordinates was chosen at the center of inertia of the body (in the Ky system), 
in that system X ér — 0, and since in that system Dp —0, M‘0? = M (993? = 0, Using the con- 
nection between the components of M'* and the vector M, we find for the latter: 


(0) (0) 
BEA ee 


A123 = A623, 


M, = MY, M, = 


t We note that whereas the classical formula for the center of inertia applies equally well to interacting 
and non-interacting particles, formula (14.6) is valid only if we neglect interaction. In relativistic mechanics, 
the definition of the center of inertia of a system of interacting particles requires us to include explicitly the 
momentum and energy of the field produced by the particles. 

1 We remind the reader that although in the system K, (in which Xp = 0) the angular momentum is 
independent of the choice of the point with respect to which it is defined. in the K system (in wnich Ep # 0) 
the angular momentum does depend on this choice (see Mechanics, § 9). 


CHAPTER 3 


CHARGES IN ELECTROMAGNETIC FIELDS 


§ 15. Elementary particles in the theory of relativity 


The interaction of particles can be described with the help of the concept of a field of force. 
Namely, instead of saying that one particle acts on another, we may say that the particle 
creates a field around itself; a certain force then acts on every other particle located in this 
field. In classical mechanics, the field is merely a mode of description of the physical 
phenomenon—the interaction of particles. In the theory of relativity, because of the finite 
velocity of propagation of interactions, the situation is changed fundamentally. The forces 
acting on a particle at a given moment are not determined by the positions at that same 
moment. A change in the position of one of the particles influences other particles only 
after the lapse of a certain time interval. This means that the field itself acquires physical 
reality. We cannot speak of a direct interaction of particles located at a distance from one 
another. Interactions can occur at any one moment only between neighboring points in 
space (contact interaction). Therefore we must speak of the interaction of the one particle 
with the field, and of the subsequent interaction of the field with the second particle. 

We shall consider two types of fields, gravitational and electromagnetic. The study of 
gravitational fields is left to Chapters 10 to 14 and in the other chapters we consider only 
electromagnetic fields. 

Before considering the interactions of particles with the electromagnetic field, we shall 
make some remarks concerning the concept of a “particle” in relativistic mechanics. 

In classical mechanics one can introduce the concept of a rigid body, i.e., a body which is 
not deformable under any conditions. In the theory of relativity it should follow similarly 
that we would consider as rigid those bodies whose dimensions all remain unchanged in the 
reference system in which they are at rest. However, it is easy to see that the theory of 
relativity makes the existence of rigid bodies impossible in general. 

Consider, for example, a circular disk rotating around its axis, and let us assume that it is 
rigid. A reference frame fixed in the disk is clearly not inertial. It is possible, however, to 
introduce for each of the infinitesimal elements of the disk an inertial system in which this 
element would be at rest at the moment; for different elements of the disk, having different 
velocities, these systems will, of course, also be different. Let us consider a series of line 
elements, lying along a particular radius vector. Because of the rigidity of the disk, the 
length of each of these segments (in the corresponding inertial system of reference) will be 
the same as it was when the disk was at rest. This same length would be measured by an 
observer at rest, past whom this radius swings at the given moment, since each of its seg- 
ments is perpendicular to its velocity and consequently a Lorentz contraction does not 
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occur. Therefore the total length of the radius as measured by the observer at rest. being the 
sum of its segments, will be the same as when the disk was at rest. On the other hand, the 
length of each element of the circumference of the disk. passing by the observer at rest at a 
given moment, undergoes a Lorentz contraction, so that the length of the whole circum- 
ference (measured by the observer at rest as the sum of the lengths of its various segments) 
turns out to be smaller than the length of the circumference of the disk at rest. Thus we 
arrive at the result that due to the rotation of the disk, the ratio of circumference to radius 
(as measured by an observer at rest) must change, and not remain equal to 27. The absurdity 
of this result shows that actually the disk cannot be rigid, and that in rotation it must 
necessarily undergo some complex deformation depending on the elastic properties of the 
material of the disk. 

The impossibility of the existence of rigid bodies can be demonstrated in another way. 
Suppose some solid body is set in motion by an external force acting at one of its points. If 
the body were rigid, all of its points would have to be set in motion at the same time as the 
point to which the force is applied; if this were not so the body would be deformed. How- 
ever, the theory of relativity makes this impossible, since the force at the particular point is 
transmitted to the others with a finite velocity, so that all the points cannot begin moving 
simultaneously. 

From this discussion we can draw certain conclusions concerning the treatment of 
"elementary" particles, i.e., particles whose state we assume to be described completely by 
giving its three coordinates and the three components of its velocity as a whole. It is obvious 
that if an elementary particle had finite dimensions, i.e. if it were extended in space, it could 
not be deformable, since the concept of deformability is related to the possibility of in- 
dependent motion of individual parts of the body. But, as we have seen, the theory of 
relativity shows that it is impossible for absolutely rigid bodies to exist. 

Thus we come to the conclusion that in classical (non-quantum) relativistic mechanics, 
we cannot ascribe finite dimensions to particles which we regard as elementary. In other 
words, within the framework of classical theory elementary particles must be treated as 
points.T 


§ 16. Four-potential of a field 


For a particle moving in a given electromagnetic field, the action is made up of two parts: 
the action (8.1) for the free particle, and a term describing the interaction of the particle with 
the field. The latter term must contain quantities characterizing the particle and quantities 
characterizing the field. 


It turns outt that the properties of a particle with respect to interaction with the electro- 
magnetic field are determined by a single parameter—the charge e of the particle, which can 
be either positive or negative (or equal to zero). The properties of the field are characterized 


+ Quantum mechanics makes a fundamental change in this situation, but here again relativity theory 
makes it extremely difficult to introduce anything other than point interactions. 

1 The assertions which follow should be regarded as being, to a certain extent, the consequence of experi- 
mental data. The form of the action for a particle in an electromagnetic field cannot be fixed on the basis of 
general considerations alone (such as, for example, the requirement of relativistic invariance). The latter 
would permit the occurrence in formula (16.1) of terms of the form f A ds, where A is a scalar function. 

To avoid any misunderstanding, we repeat that we are considering classical (and not quantum) theory, and 
therefore do not include effects which are related to the spins of particles. 
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by a four-vector 4,, the four-poiential. whose components are functions of the coordinates 
and time. These quantities appear in the action function in the term 
h 


` 
, 


— A;dx'. 
C. 
a 
where the functions 4; are taken at points on the world line of the particle. The factor I/c 
has been introduced for convenience. It should be pointed out that, so long as we have no 
formulas relating the charge or the potentials with already known quantities, the units for 
measuring these new quantities can be chosen arbitrarily. t 
Thus the action function for a charge in an electromagnetic field has the form 
h 


S= | (= meds aus?) (16.1) 
Li e 
The three space components of the four-vector A‘ form a three-dimensional vector A 
called the rector potential of the field. The time component is called the scalar potential; we 
denote it by 4? = $. Thus 

A! = ($, A). (16.2) 

Therefore the action integral can be written in the form 

b 


S= [ (-meds+£A-dr—epdr). 
c 


v 
a 


Introducing dr/dt — v, and changing to an integration over t, 
S= | (-me Jita vep) at (16.3) 
. c c 


The integrand is just the Lagrangian for a charge in an electromagnetic field: 


t — 


5 ve 
L= —mc* l—--,+-A-v-e@. (16.4) 
c 


c 


This function differs from the Lagrangian for a free particle (8.2) by the terms (e/c) A: v — eó, 
which describe the interaction of the charge with the field. 

The derivative GL /óv is the generalized momentum of the particle; we denote it by P. 
Carrying out the differentiation, we find 


Pa 42a a=pe a. (16.5) 
D € 


Here we have denoted by p the ordinary momentum of the particle, which we shall refer to 
simply as its momentum. 
From the Lagrangian we can find the Hamiltonian function for a particle in a field from 
the general formula 
eL 
XX my: L. 
ov 


+ Concerning the establishment of these units, see § 27, 
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Substituting (16.4), we get 


Pie aah: (16.6) 


However, the Hamiltonian must be expressed not in terms of the velocity, but rather in terms 
of the generalized momentum of the particle. 

From (16.5) and (16.6) it is clear that the relation between J£ —e@ and P—(e/c)A is the 
sa me as the relation between # and p in the absence of the field, i.e. 


= 2 2 
(* :j = m?c?+ (P- A) : (16.7) 

c c 

or else 

e 2 

206 —- (| n?c* ec? (r- - a) +e. (16.8) 
c 
For low velocities, i.e. for classical mechanics, the Lagrangian (16.4) goes over into 
2 
pola apod! (16.9) 
2 c 


In this approximation 


p-mv-P-— : A, 
c 
and we find the following expression for the Hamiltonian: 


1 e V 
3X =— |P- -A) creó. (16.10) 
2m Ç 
Finally we write the Hamilton-Jacobi equation for a particle in an electromagnetic field. 
It is obtained by replacing, in the equation for the Hamiltonian, P by @S/ér, and # by 
—(0S/ét). Thus we get from (16.7) 


2 J eS 2 
(vs- 54) - 3 (FF +e) tmc = 0. (16.11) 


§ 17, Equations of motion of a charge in a field 


A charge located in a field not only is subjected to a force exerted by the field, but also in 
turn acts on the field, changing it. However, if the charge e is not large, the action of the 
charge on the field can be neglected. In this case, when considering the motion of the charge 
in a given field, we may assume that the field itself does not depend on the coordinates or the 
velocity of the charge. The precise conditions which the charge must fulfil in order to be 
considered as small in the present sense, will be clarified later on (see § 75). In what follows 
we shall assume that this condition is fulfilled. 

So we must find the equations of motion of a charge in a given electromagnetic field. 
These equations are obtained by varying the action, i.e. they are given by the Lagrange 


§ 17 EQUATIONS OF MOTION OF A CHARGE IN A FIELD 47 


(F _ OL TEN 
dtNOÓv] ér’ Son 
where L is given by formula (16.4). 


The derivative OL /óv is the generalized momentum of the particle (16.5). Further, we 
write 


equations 


oL e 
2 zgra v—e grad $ 


But from a formula of vector analysis. 
grad (a: b) = (a: V)b-- (b: V)a +b x curl a +a x curl b, 
where a and b are two arbitrary vectors. Applying this formula to A v, and remembering 
that differentiation with respect to r is carried out for constant v, we find 
OL e 


Q c 


(v: VJA 4- * v x curl A — e grad $. 
g 


So the Lagrange equation has the form: 


d e e e 
—{ p+- = - (Y: V)A + - vxcurl A—e grad ¢. 
dt c c c 

But the total differential (dA/dt)dt consists of two parts: the change (CA/ér)dr of the vector 
potential with time at a fixed point in space, and the change due to motion from one point 


in space to another at distance dr. This second part is equal to (dr: V)A. Thus 
A +(yv: V)A 

—=-— + (v: : 
et 


Substituting this in the previous equation. we find 


d e cA e 
"P L8 Legrad ó- * vxcurl A. (17.2) 
c et c 


This is the equation of motion of a particle in an electromagnetic field. On the left side 
stands the derivative of the particle's momentum with respect to the time. Therefore the 
expression on the right of (17.2) is the force exerted on the charge in an electromagnetic 
field. We see that this force consists of two parts. The first part (first and second terms on the 
right side of 17.2) does not depend on the velocity of the particle. The second part (third 
term) depends on the velocity. being proportional to the velocity and perpendicular to it. 

The force of the first type, per unit charge, is called the e/ectric field intensity; we denote 
it by E. So by definition, 


E= 25 on ees $. (17.3) 


The factor of v/c in the force of the second type, per unit charge, is called the magnetic 
field intensity. We designate it by H. So by definition, 

H — curl A. (17.4) 

If in an electromagnetic field, E z 0 but H =0, then we speak of an electric field; if 

E = 0 but H £0, then the field is said to be magnetic. In general, the electromagnetic field is 


a superposition of electric and magnetic fields. 
We note that E is a polar vector while H is an axial vector. 
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The equation of motion of a charge in an electromagnetic field can now be written as 
dP _ ogy vx (17.5) 
dt c 
The expression on the right is called the Lorentz force. The first term (the force which the 
electric field exerts on the charge) does not depend on the velocity of the charge, and is 
along the direction of E. The second part (the force exerted by the magnetic field on the 
charge) is proportional to the velocity of the charge and is directed perpendicular to the 
velocity and to the magnetic field H. 
For velocities small compared with the velocity of light, the momentum p is approximately 
equal to its classical expression mv, and the equation of motion (17.5) becomes 
d e 
OY = eE4  vxH, (17.6) 
dt c 
Next we derive the equation for the rate of change of the kinetic energy of the particlet 
with time, i.e. the derivative 


It is easy to check that 


dé xin v dp 
dt dt 
Substituting dp/dt from (17.5) and noting that vx H- v = 0, we have 
dé; 
Ua = eE v. (17.7) 


The rate of change of the kinetic energy is the work done by the field on the particle per 
unit time. From (17.7) we see that this work is equal to the product of the velocity by the 
force which the electric field exerts on the charge. The work done by the field during a time 
dt, i.e. during a displacement of the charge by dr, is clearly equal to eE- dr. 

We emphasize the fact that work is done on the charge only by the electric field; the mag- 
netic field does no work on a charge moving in it. This is connected with the fact that the 
force which the magnetic field exerts on a charge is always perpendicular to the velocity of 
the charge. 

The equations of mechanics are invariant with respect to a change in sign of the time, that 
is, with respect to interchange of future and past. In other words, in mechanics the two time 
directions are equivalent. This means that if a certain motion is possible according to the 
equations of mechanics, then the reverse motion is also possible, in which the system passes 
through the same states in reverse order. 

It is easy to see that this is also valid for the electromagnetic field in the theory of relativity. 
In this case, however, in addition to changing t into — /, we must reverse the sign of the mag- 
netic field. In fact it is easy to see that the equations of motion (17.5) are not altered if we 
make the changes 

t> —t, EE, H > —H. (17.8) 


t By "kinetic" we mean the energy (9.4), which includes the rest energy. 
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According to (17.3) and (17.4), this does not change the scalar potential, while the vector 
potential changes sign: 
$—0, | A—-—A, (17.9) 
Thus, if a certain motion is possible in an electromagnetic field, then the reversed motion 
is possible in a field in which the direction of H is reversed. 


PROBLEM 


Express the acceleration of a particle in terms of its velocity and the electric and magnetic field 
intensities. 

Solution: Substitute in the equation of motion (17.5) p = v£,u/c?, and take the expression for 
dêwn/dt from (17.7). As a result, we get 


gem MESSER 5). 
m 


$ 18. Gauge invariance 


Let us consider to what extent the potentials are uniquely determined. First of all we call 
attention to the fact that the field is characterized by the effect which it produces on the 
motion of a charge located in it. But in the equation of motion (17.5) there appear not the 
potentials, but the field intensities E and H. Therefore two fields are physically identical if 
they are characterized by the same vectors E and H. 

If we are given potentials A and @, then these uniquely determine (according to (17.3) and 
(17.4)) the fields E and H. However, to one and the same field there can correspond different 
potentials. To show this, let us add to each component of the potential the quantity — @f/dx*, 
where f is an arbitrary function of the coordinates and the time. Then the potential A, goes 
over into 


E (18.1) 


As a result of this change there appears in the action integral (16.1) the additional term 


e of 
PL -4(t i} (18.2) 
which is a total differential and has no effect on the equations of motion. (See Mechanics, 
§ 2.) 
If in place of the four-potential we introduce the scalar and vector potentials, and in place 
of x', the coordinates ct, x. y, z, then the four equations (18.1) can be written in the form 
; 1 of 
A’ = A « grad f. ó-ó—--. (18.3) 
c et 
It is easy to check that electric and magnetic fields determined from equations (17.3) and 
(17.4) actually do not change upon replacement of A and @¢ by A’ and @’, defined by (18.3). 
Thus the transformation of potentials (18.1) does not change the fields. The potentials are 
therefore not uniquely defined; the vector potential is determined to within the gradient of 
an arbitrary function. and the scalar potential to within the time derivative of the same 
function. 
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In particular, we see that we can add an arbitrary constant vector to the vector potential, 
and an arbitrary constant to the scalar potential. This is also clear directly from the fact that 
the definitions of E and H contain only derivatives of A and $, and therefore the addition of 
constants to the latter does not affect the field intensities. 

Only those quantities have physical meaning which are invariant with respect to the trans- 
formation (18.3) of the potentials; in particular all equations must be invariant under this 
transformation. This invariance is called gauge invariance (in German, eichinvarianz).4 

This nonuniqueness of the potentials gives us the possibility of choosing them so that they 
fulfill one auxiliary condition chosen by us. We emphasize that we can set one condition, 
since we may choose the function fin (18.3) arbitrarily. In particular, it is always possible to 
choose the potentials so that the scalar potential $ is zero. If the vector potential is not zero, 
then it is not generally possible to make it zero, since the condition A = 0 represents three 
auxiliary conditions (for the three components of A). 


§ 19. Constant electromagnetic field 


By a constant electromagnetic field we mean a field which does not depend on the time. 
Clearly the potentials of a constant field can be chosen so that they are functions only of the 
coordinates and not of the time. A constant magnetic field is equal, as before. to H = curl A. 
A constant electric field is equal to 


= —grad $. (19.1) 


Thus a constant electric field is determined only by the scalar potential and a constant 
magnetic field only by the vector potential. 

We saw in the preceding section that the potentials are not uniquely determined. However, 
it is easy to convince oneself that if we describe the constant electromagnetic field in terms of 
potentials which do not depend on the time, then we can add to the scalar potential. without 
changing the fields, only an arbitrary constant (not depending on either the coordinates or 
the time). Usually $ is subjected to the additional requirement that it have a definite value 
at some particular point in space; most frequently $ is chosen to be zero at iniinitv. Thus the 
arbitrary constant previously mentioned is determined. and the scalar potential of the con- 
stant field is thus determined uniquely. 

On the other hand, just as before, the vector potential is not uniquely determined even 
for the constant electromagnetic field: namclv, we can add to it the gradient of an arbitrary 
function of the coordinates. 

We now determine the energy of a charge in a constant electromagnetic field. If the field 
is constant, then the Lagrangian for the charge also does not depend explicitly on the time. 
As we know, in this case the energy is conserved and coincides with the Hamiltonian. 

According to (16.6), we have 


v? 
Ji- zi 


T We emphasize that this is related to the assumed constancy of e in (18.2). Thus the gauge invariance of 
the equations of electrodynamics (see below) and the conservation of charge are closely related to onc 
another. 
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Thus the presence of the field adds to the energy of the particle the term ed, the potential 
energy of the charge in the field. We note the important fact that the energy depends only on 
the scalar and not on the vector potential. This means that the magnetic field does not affect 
the energy of the charge. Only the electric field can change the energy of the particle. This is 
related to the fact that the magnetic field, unlike the electric field, does no work on the charge. 

If the field intensities are the same at all points in space, then the field is said to be uniform. 
The scalar potential of a uniform electric field can be expresscd in terms of the field intensity 
as 

$ - —E':r. (19.3) 
In fact, since E = const, V(E-r) = (E-V)r = E. 

The vector potential of a uniform magnetic field can be expressed in terms of its field 

intensity as 
A= AH xr. (19.4) 
In fact, recalling that H — const, we obtain with the aid of well-known formulas of vector 
analysis: 
curl (H xr) = H div r—(H- V)r = 2H 
(noting that div r = 3). 
The vector potential of a uniform magnetic field can also be chosen in the form 

A, = — Hy, A,=A,=0 (19.5) 
(the z axis is along the direction of H). It is easily verified that with this choice for A we 
have H = curl A. In accordance with the transformation formulas (18.3), the potentials 
(19.4) and (19.5) differ from one another by the gradient of some function: formula (19.5) 
is obtained from (19.4) by adding Vf, where f= —xyH/2. 


PROBLEM 


Give the variational principle for the trajectory of a particle (Maupertuis’ principle) in a ccnstant 
electromagnetic field in relativistic mechanics. 

Solution: Maupertuis’ principle consists in the statement that if the energy of a particle is con- 
served (motion in a constant field), then its trajectory can be determined from the variational 
equation 


ó | P-dr=0, 
where P is the generalized momentum of the particle, expressed in terms of the energy and the 


coordinate differentials, and the integral is taken along the trajectory of the particle.T Substituting 
P = p+(e/c)A and noting that the directions of p and dr coincide, we have 


ó [ (pare) =0, 


- 


where dl = V dt? is the element of arc. Determining p from 


pac = Ga 


[4 


we obtain finally 


ó i (JS) -reas : Ad =0. 
c c 


t See Mechanics, § 44. 
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§ 20. Motion in a constant uniform electric field 


Let us consider the motion of a charge e in a uniform constant electric field E. We take 
the direction of the field as the X axis. The motion will obviously proceed in a plane, which 
we choose as the X Y plane. Then the equations of motion (17.5) become 


D, = eE, p,=0 
(where the dot denotes differentiation with respect to f), so that 
py=eEt, — p,- po. (20.1) 


The time reference point has been chosen at the moment when p, = 0; po is the momentum 
of the particle at that moment. 
The kinetic energy of the particle (the energy omitting the potential energy in the field) is 


fus cv mic? + p?. Substituting (20.1), we find in our case 


Ein = Vm ct 4 cH pà +(ceEt)? = V 62  (ceEty,, (20.2) 
where 4, is the energy at ¢ = 0. 
According to (9.8) the velocity of the particle is v = pc?/&,;,. For the velocity v, = x 
we have therefore 
dx p, — c'eEt 
dt Ekin — \'82 +(ceEt)? 
Integrating, we find 


imd /&à - (ceEtf. (20.3) 
eE 


The constant of integration we set equal to zero.t 
For determining y, we have 


dy p,c Po? 
dt Ekin VEZEN? 
from which 
y = P5 sinh"! (>) (20.4) 
^ eE bo S : 


We obtain the equation of the trajectory by expressing ¢ in terms of y from (20.4) and sub- 
stituting in (20.3). This gives: 


x = — cosh —. (20.5) 


Thus in a uniform electric field a charge moves along a catenary curve. 
If the velocity of the particle is v < c, then we can set pg = mo, o = mc’, and expand 
(20.5) in series in powers of 1/c. Then we get, to within terms of higher order, 


x= y? +const, 


eE 
2mvi 
that is, the charge moves along a parabola, a result well known from classical mechanics. 


t This result (for po = 0) coincides with the solution of the problem of relativistic motion with constant 
“proper acceleration" wo = eE/m (see the problem in § 7). For the present case, the constancy of the accelera- 
tion is related to the fact that the electric field does not change for Lorentz transformations having velocities 
V along the direction of the field (see § 24). 
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§ 21. Motion in a constant uniform magnetic field 


We now consider the motion of a charge e in a uniform magnetic field H. We choose the 
direction of the field as the Z axis. We rewrite the equation of motion 


e 
p—-vxH 
p zt 


in another form, by substituting for the momentum, from (9.8), 


ev 


PB: 


where & is the energy of the particle, which is constant in the magnetic field. The equation of 
motion then goes over into the form 


Edy e 
—— —-vxH 21.1 
edt c Gh) 
or, expressed in terms of components, 
Ùx = OV, b, = —@v,, ù, = 0, (21.2) 
where we have introduced the notation 
ecH 
ee 21.3 
o=- (21.3) 


We multiply the second equation of (21.2) by i, and add it to the first: 


d 
di (v, +iv,) = — iov, +iv,), 


so that 
v, tiv, = ae", 
where a is a complex constant. This can be written in the form a = vg,e7 where vo, and « 
are real. Then 
vs + ivy = vg, e "9*9 

and, separating real and imaginary parts, we find 

V, = Uo, COS (wt +a), Vy = — Uo, Sin (wt +a). (21.4) 
The constants vp, and « are determined by the initial conditions; « is the initial phase, and 
as for Von from (21.4) it is clear that 


21.2 
Uo, = N Ux - U, 


that is, vo, is the velocity of the particle in the X Y plane, and stays constant throughout the 
motion. 
From (21.4) we find, integrating once more, 


x = xo4- r sin (ot +a), y = yo+ r cos (ot +a), (21.5) 
where 
Vor Vof — Cp, 
we Ot 70b Ed 21.6 
s o eH eH ( ) 


(p, is the projection of the momentum on the X Y plane). From the third equation of (21.2), 
we find v, = vp, and 
Z = Zot Ug,t. (21.7) 
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From (21.5) and (21.7), it is clear that the charge moves in a uniform magnetic field along 
a helix having its axis along the direction of the magnetic field and with a radius r given by 
(21.6). The velocity of the particle is constant. In the special case where vo, = 0, that is, the 
charge has no velocity component along the field, it moves along a circle in the plane 
perpendicular to the field. 

The quantity w, as we see from the formulas, is the angular frequency of rotation of the 
particle in the plane perpendicular to the field. 

If the velocity of the particle is low, then we can approximately set & = mc?. Then the 
frequency w is changed to 

eH 
^ mc 


w (21.8) 


We shall now assume that the magnetic field remains uniform but varies slowly in 
magnitude and direction. Let us see how the motion of a charged particle changes in this 
case. 

We know that when the conditions of the motion are changed slowly, certain quantities 
called adiabatic invariants remain constant. Since the motion in the plane perpendicular to 
the magnetic field is periodic, the adiabatic invariant is the integral 


1 
1-5: Prd, 


taken over a complete period of the motion, i.e. over the circumference of a circle in the 
present case (P, is the projection of the generalized momentum on the plane perpendicular 
to Ht). Substituting P, = p,+(e/c)A, we have: 


1 1 e 
I= -dr = — -dr + — -dr. 
5, $5 r T Zr dr 


In the first term we note that p, is constant in magnitude and directed along dr; we apply 
Stokes’ theorem to the second term and write curl A = H:* 


e 

T= 1, x P" — 2eH' 
From this we see that, for slow variation of H, the tangential momentum p, varies propor- 
tionally to VH. 

This result can also be applied to another case, when the particle moves along a helical 
path in a magnetic field that is not strictly homogeneous (so that the field varies little over 
distances comparable with the radius and step of the helix). Such a motion can be considered 
as a motion in a circular orbit that shifts in the course of time, while relative to the orbit the 


(21.9) 


t See Mechanics, § 49. In general the integrals $ p dq, taken over a period of the particular coordinate 
q, are adiabatic invariants. In the present case the periods for the two coordinates in the plane perpendicular 
to H coincide, and the integral Z which we have written is the sum of the two corresponding adiabatic in- 
variants. However, each of these invariants individually has no special significance, since it depends on the 
(non-unique) choice of the vector potential of the field. The nonuniqueness of the adiabatic invariants which 
results from this is a reflection of the fact that, when we regard the magnetic field as uniform over all of space, 
we cannot in principle determine the electric field which results from changes in H, since it will actually 
depend on the specific conditions at infinity. 

*By inspecting the direction of motion of a charge along the orbit for a given direction of H, we observe that it 
is counterclockwise if we look along H. Hence the negative sign in the second term. 
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field appears to change in time but remain uniform. One can then state that the component 
of the momentum transverse to the direction of the field varies according to the law: 
p VCH, where C is a constant and H isa given function of the coordinates. On the other 
hand, just as for the motion in any constant magnetic field, the energy of the particle (and 
consequently the square of its momentum p?) remains constant. Therefore the longitudinal 
component of the momentum varies according to the formula: 


pb = p -pè = p?-CH(x, y, 2). (21.10) 


Since we should always have p? 2 0. we see that penetration of the particle into regions of 
sufficiently high field (CH > p^) is impossible. During motion in the direction of increasing 
field, the radius of the helical trajectory decreases proportionally to p,/H (i.e. proportionally 
to IA H), and the step proportionally to p,. On reaching the boundary where p, vanishes, 
the particle is reflected: while continuing to rotate in the same direction it begins to move 
opposite to the gradient of the field. 

Inhomogeneity of the field also leads to another phenomenon—a slow transverse shift 
(drift) of the guiding center of the helical trajectory of the particle (the name given to the 
center of the circular orbit); problem 3 of the next section deals with this question. 


PROBLEM 


Determine the frequency of vibration of a charged spatial oscillator, placed in a constant, 
uniform magnetic field; the proper frequency of vibration of the oscillator (in the absence of the 
field) is «o. 

Solution: The equations of forced vibration of the oscillator in a magnetic field (directed along 
the z axis) are: 


"E eH . u . . > 
XOX = — D, JrOy—-——Xx Zr wz = 0. 
m 


Multiplying the second equation by í and combining with the first, we find 


H, 
Etol = 12, 
mc 


where č =x +iy. From this we find that the frequency of vibration of the oscillator in a plane 


perpendicular to the field is 
o= Jop! [EV H 
un "'4 mc] — 2mc 


If the field H is weak, this formula goes over into 
o = OoteH/2ine. 


The vibration along the direction of the field remains unchanged. 


§ 22. Motion of a charge in constant uniform electric and magnetic fields 


Finally we consider the motion of a charge in the case where there are present both 
electric and magnetic fields, constant and uniform. We limit ourselves to the case where the 
velocity of the charge v < c, so that its momentum p = nv; as we shall sec later, it is necessary 
for this that the electric field be small compared to the magnetic. 
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We choose the direction of H as the Z axis, and the plane passing through H and E as the 
YZ plane. Then the equation of motion 


e 
mi = eE+ -vxH 
c 
can be written in the form 
e 
mš=“ H,  mj-eE,--3H, mž=eĒ,. (22.1) 
c c 


From the third equation we see that the charge moves with uniform acceleration in the Z 
direction, that is, 


eE. 
z= x tot. (22.2) 
2m 


Multiplying the second equation of (22.1) by i and combining with the first, we find 
d 
petit iei) = iE E, 


(w = eH/mc). The integral of this equation, where X -- iy is considered as the unknown, is 
equal to the sum of the integral of the same equation without the right-hand term and a 
particular integral of the equation with the right-hand term. The first of these is ae ^'^, the 
second is eE,/mo = cE,/H. Thus 
; cE 
X+ip = ae "X —*. 

The constant a is in general complex. Writing it in the form a = be^, with real b and a, we 
see that since a is multiplied by e^'*', we can, by a suitable choice of the time origin, give 
the phase « any arbitrary value. We choose this so that a is real. Then breaking up x +i) 
into real and imaginary parts, we find 


cE . 
X-acos wt + T ý= —asin ot. (22.3) 


At t = O the velocity is along the X axis. 
We see that the components of the velocity of the particle are periodic functions of the 
time. Their average values are: 


This average velocity of motion of a charge in crossed electric and magnetic fields is often 
called the electrical drift velocity. Its direction is perpendicular to both fields and independent 
of the sign of the charge. It can be written in vector form as: 


. (22.4) 


All the formulas of this section assume that the velocity of the particle is small compared 
with the velocity of light; we see that for this to be so, it is necessary in particular that the 
electric and magnetic fields satisfy the condition 


2 « 1, (22.5) 


while the absolute magnitudes of E, and H can be arbitrary. 
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y 


— x 


Fis. 6. 


Integrating equation (22.3) again, and choosing the constant of integration so that at 
t=0, x = y= 0, we obtain 
cE a 
x= z sin wt + —t; y = — (cos wt—1). (22.6) 
@ H @ 


Considered as parametric equations of a curve, these equations define a trochoid. Depend- 
ing on whether a is larger or smaller in absolute value than the quantity cE,/H, the projection 
of the trajectory on the plane X Y has the forms shown in Figs. 6a and 6b, respectively. 

If a= —cE,/ H, then (22.6) becomes 


cE 
y . 
x = — (wt —sin ot), 
oH ) 


E 
y= x (1— cos ot) (22.7) 


that is, the projection of the trajectory on the X Y plane is a cycloid (Fig. 6c). 


PROBLEMS 


1. Determine the relativistic motion of a charge in parallel uniform electric and magnetic fields. 
Solution: The magnetic field has no influence on the motion along the common direction of E 
and H (the z axis), which therefore occurs under the influence of the electric field alone; therefore 
according to $ 20 we find: 
E 
= Cum = V Sad (CEL). 


Z——— 
eE’ 


For the motion in the xy plane we have the equation 


e . e 
B; = z Hoy, by = — 7 Hv. 
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or 
ieHc 
Ekin 


d . cH, |. ; 
à Pt +ipy) Ss — Ws tity) UA (p. ip,). 


Consequently 
P:+ipy = pe~, 


where p, is the constant value of the projection of the momentum on the xy plane, and the auxiliary 
quantity ¢ is defined by the relation 


dt 
dọ —eHc —, 
6 s C kin 
from which 
A E 
= —. sinh — ¢. l 
ct zE sinh n? (1) 


Furthermore we have: 


-i$ | "X D 
Px ipy = pe = -7 (& +p) = —— dé 
so that 
x= S sin g, y = 27 cos é. Q) 
Formulas (1), (2) together with the formula 
Eo E 
z= zg cosh "i2 Q) 


determine the motion of the particle in parametric form. The trajectory is a helix with radius 
cp,/eH and monotonically increasing step, along which the particle moves with decreasing angular 
velocity ¢ = eHc/é x1, and with a velocity along the z axis which tends toward the value c. 


2. Determine the relativistic motion of a charge in electric and magnetic fields which are mutually 
perpendicular and equal in magnitude.t 
Solution: Choosing the z axis along H and the y axis along E and setting E — H, we write the 


equations of motion: 

dp. e dp, Uv; dp; 

oP: _© Ey Seb (ie = Bd 
di c» di ( jl s 


and, as a consequence of them, formula (17.7), 
db yin 
dt 


= eEv,. 
From these equations we have: 
Pz = const, Ekm — CP: = Const = a. 
Also using the equation 
Chin — Cp? = (Erin HCP) Ein — CPx) = c?p? -+ e? 
(where ¢? = m?c* -- c?p? = const), we find: 


l 
Eint Cpa = Lp) +e), 


and so 
a cep? +e? 
E =- v 
kin 2 T 2a , 
L8 epi +e 
Ps 2c 2ac 


t The problem of motion in mutually perpendicular fields E and H which are not equal in magnitude can, 
by a suitable transformation of the reference system, be reduced to the problem of motion in a pure electric 
or a pure magnetic field (see $ 25). 
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Furthermore, we write 


d 4 ust. 
Ekin T -eE (^. ten = CE( kin — CPs) = eEa, 


from which 
5 e? c? 
2eEt =( t+ ai p, 32? Py. (I) 
To determine the trajectory, we make a transformation of variables in the equations 
dx — cp, 
dt kin 
to the variable p, by using the relation dt = 4,4, dp,/eEx, after which integration gives the formulas: 
c e? cà 
= — | =l- — —— p? 
* xí E 7) Pyt 6x2eE @) 
y _ c 2 Ps c? 


Formulas (1) and (2) completely determine the motion of the particle in parametric form (parameter 
Py). We call attention to the fact that the velocity increases most rapidly in the direction per- 
pendicular to E and H (the x axis). 


3. Determine the velocity of drift of the guiding center of the orbit of a nonrelativistic charged 
particle in a quasihomogeneous magnetic field (H. Alfven, 1940). 

Solution: We assume first that the particle is moving in a circular orbit, i.e. its velocity has no 
longitudinal component (along the field). We write the equation of the trajectory in the form 
r — R(r)--C(r), where R(z) is the radius vector of the guiding center (a slowly varying function of 
the time), while G(f) is a rapidly oscillating quantity describing the rotational motion about the 
guiding center. We average the force (e/c)t x H(r) acting on the particle over a period of the oscil- 
latory (circular) motion (compare Mechanics, $ 30). We expand the function H(r) in this expression 
in powers of ¢: 

H(r) = HIR) - (56; V)H(R). 
On averaging, the terms of first order in ¢(1) vanish, while the second-degree terms give rise to 
an additional force 


f- ix ( DH. 
For a circular orbit 
=e avs 
t = o% xn, [4 "E 


where n is a unit vector along H; the frequency c — eH/[mc; v , is the velocity of the particle in its 
circular motion. The average values of products of components of the vector C. rotating in a plane 
(the plane perpendicular to n), are: 


Cabs = Mu, 
where ó,; is the unit tensor in this plane. As a result we find: 


Because of the equations div H = 0 and curl H =0 which the constant field H(R) satisfies, we 
have: 


(nx J) x H= -ndivH + (nr V)H +n x (V x H) = (n )H = H(n: V)n + n(n: VH). 
We are interested in the force transverse to n, giving rise to a shift of the orbit; it is equal to 


mz? mv? 
f= — z^ Vms V 


where p is the radius of curvature of the force line of the field at the given point, and v is a unit 
vector directed from the center of curvature to this point. 
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The case where the particle also has a longitudinal velocity v, ( along n) reduces to the previous case 
if we go over to a reference frame which is rotating about the instantaneous center of curvature of 
the force line (which is the trajectory of the guiding center) with angular velocity v;/p. In this 
reference system the particle has no longitudinal velocity, but there is an additional transverse force, 
the centrifugal force mv?/p. Thus the total transverse force is 


m v7 
fi= M" (+5). 


This force is equivalent to a constant electric field of strength f,/e. According to (22.4) it 
causes a drift of the guiding center of the orbit with a velocity 


1 v? 
Va at + 1) vxn. 


The sign of this velocity depends on the sign of the charge. 


§ 23. The electromagnetic ficld tensor 


In $ 17, we derived the equation of motion of a charge in a field, starting from the 
Lagrangian (16.4) written in three-dimensional form. We now derive the same equation 
directly from the action (16.1) written in four-dimensional notation. 

The principle of least action states 


b 
as =ô [ (-meas-* Aa?) = 0. (23.1) 
Noting that ds = V dx; dx', we find (the limits of integration a and b are omitted for brevity): 
“dbx! ; 
áóS- — Í (me dxdóx e dox taadi) =0. 
ds c c 


We integrate the first two terms in the integrand by parts. Also, in the first term we set 
dx;/ds = uj, where u; are the components of the four-velocity. Then 


Í (me du; óxi + : óx! dA;— 34, 2) — (meu. : A) öx =0. (23.2) 


The second term in this equation is zero, since the integral is varied with fixed coordinate 
values at the limits. Furthermore: 


0A; 0A; 
6A; = ax* óx*, dA; = ox dx", 
and therefore 
i @0A;. i e ðA; , ; 
Í (medu, ôx tae ôx dxt— A dx 2) =0. 


In the first term we write du; = (du,/ds)ds, in the second and third, dx’ = u‘ds. In addition, 
in the third term we interchange the indices i and k (this changes nothing since the indices i 
and k are summed over). Then 


du, ef0A, OAM , "M 
f [me ds aa) w^] atas = 0. 
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In view of the arbitrariness of 5x’, it follows that the integrand is zero, that is, 


du; e (3 22) k 
= u. 


ds c\ðx?  óx' 


nic 


We now introduce the notion 
Fy = ok i (23.3) 


The antisymmetric tensor Fj is called the electromagnetic field tensor. The equation of 
motion then takes the form: 
dui e, 
—=- FÉ. 3.4 
meg =F (23.4) 
These are the equations of motion of a charge in four-dimensional form. 
The meaning of the individual components of the tensor F;, is easily seen by substituting 
the values A; = ($, —A) in the definition (23.3). The result can be written as a matrix in 
which the index i = 0, I, 2, 3 labels the rows, and the index & the columns: 


0 E, E, E, 0 -E, -E, —E, 
-E 0 —H H , E 0 -H H 
Fy = x z »L F*=| 7 7 *L. (23.5 
* |E, H, 0 —H, E, H, 0 —H, (23.5) 
-E, -H, H, 0 E, -H, H, 0; 


More briefly, we can write (see § 6): 
Fi = (E, H), F* = (—E, H). 

Thus the components of the electric and magnetic field strengths are components of the 
same electromagnetic field four-tensor. 

Changing to three-dimensional notation, it is easy to verify that the three space com- 
ponents (i = 1, 2, 3) of (23.4) are identical with the vector equation of motion (17.5), while 
the time component (i — 0) gives the work equation (17.7). The latter 1s a consequence of 
the equations of motion; the fact that only three of the four equations are independent can 
also easily be found directly by multiplying both sides of (23.4) by u’. Then the left side of the 
equation vanishes because of the orthogonality of the four-vectors v and du,/ds, while the 
right side vanishes because of the antisymmetry of Fix. 

If we admit only possible trajectories when we vary S, the first term in (23.2) vanishes 
identically. Then the second term, in which the upper limit is considered as variable, gives the 
differential of the action as a function of the coordinates. Thus 


S = — (meu £ D) ôx. (23.6) 
c 
Then 
S e e 
— Za = meut - A; = pit - A;. (23.7) 
Ox c c 


The four-vector —@S/@x' is the four-vector P; of the generalized momentum of the particle. 
Substituting the values of the components p; and A,, we find that 


"7 
pi = (Bette, p+ fA). (23.8) 


As expected, the space components of the four-vector form the three-dimensional general- 
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ized momentum vector (16.5), while the time component is &/c, where & is the total energy 
of the charge in the field. 


§ 24. Lorentz transformation of the field 


In this section we find the transformation formulas for fields, that is, formulas by means 
of which we can determine the field in one inertial system of reference, knowing the same 
field in another system. 

The formulas for transformation of the potentials are obtained directly from the general 


formulas for transformation of four-vectors (6.1). Remembering that 4‘ = (ó, A), we get 
easily 


V V 
$ +- A, A,+— ¢ 
c c 


= TE A,= ee y= Ay, A, = Az: (24.1) 
]1- — er 
c? l c? 


The transformation formulas for an antisymmetric second-rank tensor (like F'*) were 
found in problem 2 of § 6: the components F?? and F?! do not change, while the com- 
ponents F??, F°, and F!?, F!? transform like x? and x!, respectively. Expressing the 
components of F* in terms of the components of the fields E and H, according to (23.5), 
we then find the following formulas of transformation for the electric field: 


V V 
E,- os H; E, — = H, 
E, = E, E, = ; E, 2 —— j (24.2) 
Ja Xe 
c? c 
and for the magnetic field: 
] H,— — E; H; + — E, 
c 
H, = H, H, = —— H, = ———-. (24.3) 


y? t y? 
Re eu 
c N c 


Thus the electric and magnetic fields, like the majority of physical quantities, are relative; 
that is, their properties are different in different reference systems. In particular, the electric 
or the magnetic field can be equal to zero in one reference system and at the same time be 
present in another system. 


The formulas (24.2), (24.3) simplify considerably for the case V < c. To terms of order 
Vic, we have: 


V 
E, = E,, E, = E,+ "E E, = E,— r Hj; 
c 


V V 
H, = Hy, H, = H,- ~ Ey, H, = H+ Z Ey 


These formulas can be written in vector form 


1 
E-E«-HXxV,HeHW-iEXxV. (24.4) 
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The formulas for the inverse transformation from K' to K are obtained directly from 
(24.2)-(24.4) by changing the sign of V and shifting the prime. 

If the magnetic field H' = 0 in the K' system, then, as we easily verify on the basis of 
(24.2) and (24.3), the following relation exists between the electric and magnetic fields in 
the K system: 


1 
H = Pad x E. (24.5) 
If in the K' system, E' = 0, then in the K system 


E- -iVxH. (24.6) 


Consequently, in both cases, in the K system the magnetic and electric fields are mutually 
perpendicular. 

These formulas also have a significance when used in the reverse direction: if the fields E 
and H are mutually perpendicular (but not equal in magnitude) in some reference system K, 
then there exists a reference system K' in which the field is pure electric or pure magnetic. 
The velocity V of this system (relative to K) is perpendicular to E and H and equal in 
magnitude to cH/E in the first case (where we must have H « E) and to cE/H in the second 
case (where E < H). 


$ 25. Invariants of the field 


From the electric and magnetic field intensities we can form invariant quantities, which 
remain unchanged in the transition from one inertial reference system to another. 

The form of these invariants is easily found starting from the four-dimensional representa- 
tion of the field using the antisymmetric four-tensor F^. It is obvious that we can form the 
following invariant quantities from the components of this tensor: 

F,,F* = inv, (25.1) 

emp „Fim inv, (25.2) 
where e'*" is the completely antisymmetric unit tensor of the fourth rank (cf. § 6). The first 
quantity is a scalar, while the second is a pseudoscalar (the product of the tensor F** with its 
dual tensor.t 

Expressing F* in terms of the components of E and H using (23.5), it is easily shown that, 
in three-dimensional form, these invariants have the form: 

H? —E? = inv, (25.3) 

E-H = inv. (25.4) 

The pseudoscalar character of the second of these is here apparent from the fact that it is the 

product of the polar vector E with the axial vector H (whereas its square (E: H)? is a true 
scalar). 


t We also note that the pseudoscalar (25.2) can also be expressed as a four-divergence: 
d d 
en Fie Fim = 4 »x (ena. xi An), 


as can be easily verified by using the antisymmetry of e*'™. 
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From the invariance of the two expressions presented, we get the following theorems. If 
the electric and magnetic fields are mutually perpendicular in any reference system, that is, 
E-H = 0, then they are also perpendicular in every other inertial reference system. If the 
absolute values of E and H are equal to each other in any reference system, then they are the 
same in any other system. 

The following inequalities are also clearly valid. If in any reference system E> H (or 
H > E), then in every other system we will have E> H (or H > E). If in any system of 
reference the vectors E and H make an acute (or obtuse) angle, then they will make an acute 
(or obtuse) angle in every other reference system. 

By means of a Lorentz transformation we can always give E and H any arbitrary values, 
subject only to the condition that E? — H? and E-H have fixed values. In particular, we 
can always find an inertial system in which the electric and magnetic fields are parallel to 
each other at a given point. In this system E- H — EH, and from the two equations 


E-H'-El-Hl EH =E, Ho. 


we can find the values of E and H in this system of reference (Eg and H, are the electric and 
magnetic fields in the original system of reference). 

The case where both invariants are zero is excluded. In this case, E and H are equal and 
mutually perpendicular in all reference systems. 

If E- H = 0, then we can always find a reference system in which E = 0 or H = 0 (accord- 
ing as E?— H? < or » 0), that is, the field is purely magnetic or purely electric. Con- 
versely, if in any reference system E = 0 or H = 0, then they are mutually perpendicular in 
every other system, in accordance with the statement at the end of the preceding section. 

We shall give still another approach to the problem of finding the invariants of an anti- 
symmetric four-tensor. From this method we shall, in párticular, see that (25.3-4) are 
actually the only two independent invariants and at the same time we will explain some 
instructive mathematical properties of the Lorentz transformations when applied to such 
a four-tensor. 

Let us consider the complex vector 


F =E+iH. (25.5) 


Using formulas (24.2-3), it is easy to see that a Lorentz transformation (along the x axis) 
for this vector has the form 


F, = Fy, F, = F, cosh ġ— iF; sinh ó = F, cos iġ—F; sin ig. 
F, = F; cos iġ +F, sin ig, tanh $ = 4 (25.6) 

We see thata rotation in the x, ¢ plane in four-space (which is what this Lorentz transforma- 
tion is) for the vector F is equivalent to a rotation in the v, z plane through an imaginary 
angle in three-dimensional space. The set of all possible rotations in four-space (including 
also the simple rotations around the x, y, and z axes) is equivalent to the set of all possible 
rotations, through complex angles in three-dimensional space (where the six angles of 
rotation in four-space correspond to the three complex angles of rotation of the three- 
dimensional system). 

The only invariant of a vector with respect to rotation is its square: F? = E?— H? + 
+2i E-H; thus the real quantities E?— H? and E-H are the only two independent 
invariants of the tensor Fy. 
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If F? # 0, the vector F can be written as F = an, where nis a complex unit vector (n? = 1). 
By a suitable compicx rotation we can point n along one of the coordinate axes; it is clear 
that then n becomes real and determines the directions of the two vectors E and H: 
F = (£+iH)n; in other words we get the result that E and H become parallel to one 
another. 


PROBLEM 


Determine the velocity of the system of reference in which the electric and magnetic fields are 
parallel, 

Solution: Systems of reference K', satisfying the required condition, exist in infinite numbers. If 
we have found one such, then the same property will be had by any other system moving relative 
to the first with its velocity directed along the common direction of E and H. Therefore it is sufficient 
to find one of these systems which has a velocity perpendicular to both fields. Choosing the 
direction of the velocity as the x axis, and making use of the fact that in K’: E= H,-—0, 
E, H,—E,H, — 0, we obtain with the aid of formulas (24.2) and (24.3) for the velocity V of the 
K' system relative to the original system the following equation: 


(we must choose that root of the quadratic equation for which V « c). 


CHAPTER 4 


THE ELECTROMAGNETIC FIELD EQUATIONS 


§ 26. The first pair of Maxwell’s equations 
From the expressions 


] ĝA 
H = curl A, = — -— — grad ¢ 
c 
it is easy to obtain equations containing only E and H. To do this we find curl E: 


18 
curl E = — - — curl A — curl grad 9. 


cat 
But the curl of any gradient is zero. Consequently, 
1 ôH 
curl E = — -—-. (26.1) 
c ôt 


Taking the divergence of both sides of the equation curl A = H, and recalling that div 
curl = 0, we find 
div H = 0. (26.2) 
The equations (26.1) and (26.2) are called the first pair of Maxwell's equations.t We note 
that these two equations still do not completely determine the properties of the fields. This is 
clear from the fact that they determine the change of the magnetic field with time (the 
derivative OH/Ot), but do not determine the derivative CE/t. 
Equations (26.1) and (26.2) can be written in integral form. According to Gauss' theorem 


f div Bav = $ H-a, 


where the integral on the right goes over the entire closed surface surrounding the volume 
over which the integral on the left is extended. On the basis of (26.2), we have 


$H: df — 0. (26.3) 


The integral of a vector over a surface is called the flux of the vector through the surface. 
Thus the flux of the magnetic field through every closed surface is zero. 
According to Stokes’ theorem, 
f eut E-at = $ E-dl, 
where the integral on the right is taken over the closed contour bounding the surface over 


t Maxwell’s equations (the fundamental equations of electrodynamics) were first formulated by him in 
the 1860's. 
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which the left side is integrated. From (26.1) we find, integrating both sides for any surface, 


16 
D d| = — TILES (26.4) 


The integral of a vector over a closed contour is called the circulation of the vector around 
the contour. The circulation of the electric field is also called the electromotive force in the 
given contour. Thus the electromotive force in any contour is equal to minus the time 
derivative of the magnetic flux through a surface bounded by this contour. 

The Maxwell equations (26.1) and (26.2) can be expressed in four-dimensional notation. 
Using the definition of the electromagnetic field tensor 


Fi, = 0A,JOx' —0A,/Ox*, 
it is easy to verify that 
OF, OF, OF 
w 0M LOI Log. (26.5) 


ax! ^ ax! — ax* 


The expression on the left is a tensor of third rank, which is antisymmetric in all three indices. 
The only components which are not identically zero are those with i 4 k # l. Thus there are 
altogether four different equations which we can easily show [by substituting from (23.5)] 
coincide with equations (26.1) and (26.2). 

We can construct the four-vector which is dual to this antisymmetric four-tensor of rank 
three by multiplying the tensor by e'*'" and contracting on three pairs of indices (see § 6). 
Thus (26.5) can be written in the form 
OF im 


ikim 
e 
àx* 


- 0, (26.6) 


which shows explicitly that there are only four independent equations. 


8 27. The action function of the electromagnetic field 


The action function S for the whole system, consisting of an electromagnetic field as well 
as the particles located in it, must consist of three parts: 


S = S,tSn+Smys (27.1) 


where S,, is that part of the action which depends only on the properties of the particles, 
that is, just the action for free particles. For a single free particle, it is given by (8.1). If there 
are several particles, then their total action is the sum of the actions for each of the individual 
particles. Thus, 


Sn = — Y mc | ds. (27.2) 


The quantity S,, is that part of the action which depends on the interaction between 
the particles and the field. According to § 16, we have for a system of particles: 


Some : Í A,dx*: (27.3) 
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In each term of this sum, A, is the potential of the field at that point of spacetime at which 
the corresponding particle is located. The sum Sm + Smp is already familiar to us as the action 
(16.1) for charges in a field. 

Finally S, is that part of the action which depends only on the properties of the field itself, 
that is, S, is the action for a field in the absence of charges. Up to now, because we were 
interested only in the motion of charges in a given electromagnetic field, the quantity S,, 
which does not depend on the particles, did not concern us, since this term cannot affect 
the motion of the particles. Nevertheless this term is necessary when we want to find 
equations determining the field itself. This corresponds to the fact that from the parts 
Smt Smp of the action we found only two equations for the field, (26.1) and (26.2), which 
are not yet sufficient for complete determination of the field. 

To establish the form of the action S, for the field, we start from the following very 
important property of electromagnetic fields. As experiment shows, the electromagnetic field 
satisfies the so-called principle of superposition. This principle consists in the statement that 
the field produced by a system of charges is the result of a simple composition of the fields 
produced by each of the particles individually. This means that the resultant field intensity 
at each point is equal to the vector sum of the individual field intensities at that point. 

Every solution of the field equations gives a field that can exist in nature. According to the 
principle of superposition, the sum of any such fields must be a field that can exist in nature, 
that is, must satisfy the field equations. 

As is well known, linear differential equations have just this property, that the sum of any 
solutions is also a solution. Consequently the field equations must be linear differential 
equations. 

From the discussion, it follows that under the integral sign for the action S, there must 
stand an expression quadratic in the field. Only in this case will the field equations be linear; 
the field equations are obtained by varying the action, and in the variation the degree of the 
expression under the integral sign decreases by unity. 

The potentials cannot enter into the expression for the action S,, since they are not 
uniquely determined (in S, , this lack of uniqueness was not important). Therefore S, must 
be the integral of some function of the electromagnetic field tensor F;,. But the action must 
be a scalar and must therefore be the integral of some scalar. The only such quantity is the 
product F F**.t 

Thus S, must have the form: 


$5 a | [ FyF*avar, dV — dx dy dz, 


where the integral extends over all of space and the time between two given moments; a is 
some constant. Under the integral stands F, F'* = 2(H? — E?). The field E contains the 
derivative QA/Ot; but it is easy to see that (0A/0t)? must appear in the action with the 
positive sign (and therefore E? must have a positive sign). For if (0A/0t)? appeared in S, 


t The function in the integrand of S, must not include derivatives of F;,, since the Lagrangian can contain, 
aside from the coordinates, only their first time derivatives. The role of “coordinates” (i.e., parameters to be 
varied in the principle of least action) is in this case played by the field potential 4x; this is analogous to the 
situation in mechanics where the Lagrangian of a mechanical system contains only the coordinates of the 
particles and their first time derivatives. 

As for the quantity e'™F, Fim (§ 25), as pointed out in the footnote on p. 63, it is a complete four- 
divergence, so that adding it to the integrand in S, would have no effect on the “equations of motion". It is 
interesting that this quantity is already excluded from the action for a reason independent of the fact that it is 
a pseudoscalar and not a true scalar. 
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with a minus sign, then sufficiently rapid change of the potential with time (in the time 
interval under consideration) could always make S, a negative quantity with arbitrarily 
large absolute value. Consequently S, could not have a minimum, as is required by the 
principle of least action. Thus, a must be negative. 

The numerical value of a depends on the choice of units for measurement of the field. 
We note that after the choice of a definite value for a and for the units of measurement of 
field, the units for measurement of all other electromagnetic quantities are determined. 

From now on we shall use the Gaussian system of units; in this system a is a dimension- 
less quantity, equal to —(1/167).+ 

Thus the action for the field has the form 


1 
16zc 


In three-dimensional form: 


S,- F F*dQ, dQ = cdtdx dy dz. (27.4) 


1 
S; = às | (E? — H?) dVdt. (27.5) 
In other words, the Lagrangian for the field is 
1 
L=; | &-8av. (27.6) 


The action for field plus particles has the form 


--y Í meds— Y, Í : A, di* — — Í F,, F*dQ. (27.7) 


Weernphasize that now the charges are not assumed to be small, as in the derivation of the 
equation of motion of a charge in a given field. Therefore A, and F;, refer to the actual field, 
that is, the external field plus the field produced by the particles themselves; 4, and F;, now 
depend on the positions and velocities of the charges. 


$ 28. The four-dimensional current vector 


Instead of treating charges as points, for mathematical convenience we frequently 
consider them to be distributed continuously in space. Then we can introduce the “charge 
density" o such that odV is the charge contained in the volume dV. The density g is in general 
a function of the coordinates and the time. The integral of ọ over a certain volume is the 
charge contained in that volume. 

Here we must remember that charges are actually pointlike, so that the density o is zero 
everywhere except at points where the point charges are located, and the integral f eav 
must be equal to the sum of the charges contained in the given volume. Therefore ọ can be 


t In addition to the Gaussian system, one also uses the Heaviside system, in which a= —4. In this 
system of units the field equations have a more convenient form (4x does not appear) but on the other 
hand, z appears in the Coulomb law. Conversely, in the Gaussian system the field equations contain 4z, but 
the Coulomb law has a simple form. 
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expressed with the help of the ó-function in the following formt: 


e = > e,Ó(r—r,) (28.1) 


where the sum goes over all the charges and r, is the radius vector of the charge e,. 

The charge on a particle is, from its very definition, an invariant quantity, that is, it does 
not depend on the choice of reference system. On the other hand, the density g is not generally 
an invariant—only the product odV is invariant. 

Multiplying the equality de = odV on both sides with dx’: 

d i 
de dx! = odVdx! = odV dt A 
On the left stands a four-vector (since de is a scalar and dx' is a four-vector). This means 
that the right side must be a four-vector. But dV dt is a scalar, and so o(dx'/dr) is a four- 
vector. This vector (we denote it by j^) is called the current four-vector: 
dx! 
j-g-—-. 28.2 
Jen (28.2) 


The space components of this vector form the current density vector, 


j= ev. (28.3) 
where v is the velocity of the charge at the given point. The time component of the four- 


t The ó-function d(x) is defined as follows: d(x) = 0, for all nonzero values of x; for x = 0, 5(0) = œ, in 
such a way that the integral 


+o 
f 5(x) dx — 1. (1) 
From this definition there result the following properties: if f(x) is any continuous function, then 
+0 
f f(x) 6(x — a) dx = f(a), aD 
and in particular, 
f fx) 6(x) dx =f(0). (III) 


(The limits of integration, it is understood, need not be + ©; the range of integration can be arbitrary, 
provided it includes the point at which the ó-function does not vanish.) 

The meaning of the following equalities is that the left and right sides give the same result when introduced 
as factors under an integral sign: 


&-—x)-», (ax) = ial &(9. av) 
The last equality is a special case of the more general relation 


1 
ó[ó(x)] = È PN é(x—a), (V) 


where ¢(x) is a single-valued function (whose inverse need not be single-valued) and the a, are the roots of 
the equation g(x) = 0. 

Just as 6(x) was defined for one variable x, we can introduce a three-dimensional ó-function, ó(r), equal to 
zero everywhere except at the origin of the three-dimensional coordinate system, and whose integral over all 
space is unity. As such a function we can clearly use the product ó(x) d(y) ó(z). 
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vector (28.2) is co. Thus 
JÏ = (co, p. (28.4) 


The total charge present in all of space is equal to the integral j gdV over all space. We 
can write this integral in four-dimensional form: 


] 1 ; 
f gdV = z [pav =" fras, (28.5) 


where the integral is taken over the entire four-dimensional hyperplane perpendicular to the 
x? axis (clearly this integration means integration over the whole three-dimensional space). 
Generally, the integral 

if, 

- | dS; 

c 


over an arbitrary hypersurface is the sum of the charges whose world lines pass through this 
surface. 

Let us introduce the current four-vector into the expression (27.7) for the action and 
transform the second term in that expression. Introducing in place of the point charges e a 
continuous distribution of charge with density o, we must write this term as 

_! Í oA, dx'dV, 
c 
replacing the sum over the charges by an integral over the whole volume. Rewriting in the 
form 


we see that this term is equal to 


1 TEC 
— ci f Ajj aQ. 


Thus the action S takes the form 


l i 1 ik 
S= -¥ | meds- 5 [ Ajao- is | FuF dQ. (28.6) 


§ 29. The equation of continuity 


The change with time of the charge contained in a certain volume is determined by the 


derivative 
ô f dv 
ôt eae 


On the other hand, the change in unit time, say, is determined by the quantity of charge 
which in unit time leaves the volume and goes to the outside or, conversely, passes to its 
interior. The quantity of charge which passes in unit time through the element df of the 
surface bounding our volume is equal to gv: df, where v is the velocity of the charge at the 
point in space where the element df is located. The vector df is directed, as always, along 
the external normal to the surface, that is, along the normal toward the outside of the volume 
under consideration. Therefore ov : df is positive if charge leaves the volume, and negative 
if charge enters the volume. The total amount of charge leaving the given volume per 
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unit time is consequently $ ov- df, where the integral extends over the whole of the closed 
surface bounding the volume. 
From the equality of these two expressions, we get 


ð 
FILE - $ ovat (29.1) 


The minus sign appears on the right, since the left side is positive if the total charge in the 
given volume increases. The equation (29.1) is the so-called equation of continuity, expressing 
the conservation of charge in integral form. Noting that gv is the current density, we can 
rewrite (29.1) in the form 

1 


G , 
S[ev--óra. (29.2) 

We also write this equation in differential form. To do this we apply Gauss' theorem to 
(29.2): 


$ j-df = | diviav. 


0p 
divj+ — )dV =0. 
f( 1v jc x) 


Since this must hold for integration over an arbitrary volume, the integrand must be zero: 


and we find 


2.10 
div j+ 2 = 0. (29.3) 
ot 


This is the equation of continuity in differential form. 

It is easy to check that the expression (28.1) for @ in ó-function form automatically 
satisfies the equation (29.3). For simplicity we assume that we have altogether only one 
charge, so that 

o = eô (r— rg). 
The current j is then 

j= ev ôlr — ro), 
where v is the velocity of the charge. We determine the derivative 0o/6t. During the motion 
of the charge its coordinates change, that is, the vector rọ changes. Therefore 


ðo ĉo Oo 
at Org Ot 
But órg/ót is just the velocity v of the charge. Furthermore, since o is a function of r—ro, 
ĉo ĉo 
órj ðr 
Consequently 
ĉo . 
— = —v: grad 9 = —div (ov) 
at 
(the velocity v of the charge of course does not depend on r). Thus we arrive at the equation 
(29.3). 


It is easily verified that, in four-dimensional form, the continuity equation (29.3) is 
expressed by the statement that the four-divergence of the current four-vector is zero: 


Z =0, (29.4) 
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In the preceding section we saw that the total charge present in all of space can be written 


as 
1 : 
= fras, 
c 


where the integration is extended over the hyperplane x? = const. At each moment of time, 
the total charge is given by such an integral taken over a different hyperplane perpendicular 
to the x? axis. It is easy to verify that the equation (29.4) actually leads to conservation of 
charge, that ts, to the result that the integral fj/dS; is the same no matter what hyperplane 
x? = const we integrate over. The difference between the integrals f j'dS; taken over two 
such hyperplanes can be written in the form $ j'dS,, where the integral is taken over the 
whole closed hypersurface surrounding the four-volume between the two hyperplanes under 
consideration (this integral differs from the required integral because of the presence of the 
integral over the infinitely distant "sides" of the hypersurface which, however, drop out, 
since there are no charges at infinity). Using Gauss' theorem (6.15) we can transform this to 
an integral over the four-volume between the two hyperplanes and verify that 


e. [9 
$ jidS, = Í E dQ =0. (2.5) 


The proof presented clearly remains valid also for any two integrals fj'as,, in which 
the integration is extended over any two infinite hypersurfaces (and not just the hyperplanes 
x9 = const) which each contain all of three-dimensional space. From this it follows that 


the integral 
T NE 
«fons 
c 


is actually identical in value (and equal to the total charge in space) no matter over what 
such hypersurface the integration is taken. 

We have already mentioned (see the footnote on p. 50) the close connection between the 
gauge invariance of the equations of electrodynamics and the law of conservation of charge. 
Let us show this once again using the expression for the action in the form (28.6). On re- 
placing A; by 4;— (0f/0x'), the integral 

1 . 0 

3 Í j Zao 
is added to the second term in this expression. It is precisely the conservation of charge, as 
expressed in the continuity equation (29.4), that enables us to write the integrand as a four- 
divergence 6(fj')/dx', after which, using Gauss’ theorem, the integral over the four-volume 
is transformed into an integral over the bounding hypersurface; on varying the action, these 
integrals drop out and thus have no effect on the equations of motion. 


§ 30. The second pair of Maxwell equations 


In finding the field equations with the aid of the principle of least action we must assume 
the motion of the charges to be given and vary only the potentials (which serve as the 
“coordinates” of the system); on the other hand, to find the equations of motion we assumed 
the field to be given and varied the trajectory of the particle. 
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Therefore the variation of the first term in (28.6) is zero, and in the second we must not 
vary the current j*. Thus, 


ôS = — REZI: i a dQ = 0. 
cle 8x 


(where we have used the fact that F*óF;, = F,6F*). Substituting Fy, = 0A,/éx'—@Aj/ax* 
we have 
Ue. voe 1, ô 
Sa ga E — — F* — 64,$ dQ 
5S [ins go PhS bin 5 HS bah 


In the second term we interchange the indices i and k, over which the expressions are 
summed, and in addition replace F'* by — F*, Then we obtain 


si 1 ik C 
ös = - [rn a sox 84} dn 


The second of these integrals we integrate by parts, that is, we apply Gauss’ theorem: 
; aF” 
s=- pE vm dQ- | P4, 4S,. 
c 4n ax* 


In the second term we must insert the values at the limits of integration. The limits for the 
coordinates are at infinity, where the field is zero. At the limits of the time integration, that is, 
at the given initial and final time values, the variation of the potentials is zero, sincein accord 
with the principle of least action the potentials are given at these times. Thus the second term 


in (30.1) is zero, and we find 
1, 1 6F* 
fG t— ipw PLA dQ = 0. 


Since according to the principle of least action, the variations 6A, are arbitrary, the co- 
efficients of the 6A; must be set equal to zero: 
or* 4n 4 
su Te (30.2) 
Let us express these four (i = 0, 1, 2, 3) equations in three-dimensional form. For i = 1: 
oF}! p OFM OF! LaF" 4n , 
x © y Oz c ôt e. 
Su. stituting the values for the components of F'*, we find 
oH, 0H, 10E, n, 


oy Oz côt c 


This together with the two succeeding equations (i = 2, 3) can be written as one vector 
equation: 


(30.1) 


curl H = cor +—j. (30.3) 


Finally, the fourth equation (i = 0) gives 
div E = 4ng. (30.4) 
Equations (30.3) and (30.4) are the second pair of Maxwell equations.t Together with the 


t The Maxwell equations in a form applicable to point charges in the electromagnetic field in vacuum 
were formulated by H. A. Lorentz. 
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first pair of Maxwell equations they completely determine the electromagnetic field, and are 
the fundamental equations of the theory of such fields, i.e. of electrodynamics. 

Let us write these equations in integral form. Integrating (30.4) over a volume and 
applying Gauss' theorem 


f div Eav = $ E- at, 
we get 

$E-df = 4n | odV. (30.5) 
Thus the flux of the electric field through a closed surface is equal to 4r times the total charge 


contained in the volume bounded by the surface. 
Integrating (30.3) over an open surface and applying Stokes’ theorem 


f curl df = $ Hdl, 


“we find 
4n f. 
$n a-i E-de T [cat (30.6) 
The quantity 
1 GE 
——— 30.7 
4n ôt (30.7) 
is called the "displacement current". From (30.6) written in the form 
4n 1 GE 
H-dl= i+ df, 30.8 
$iea-T [eO z) 9n) 


we see that the circulation of the magnetic field around any contour is equal to 4z/c times 
the sum of the true current and displacement current passing through a surface bounded by 
this contour. 

From the Maxwell equations we can obtain the already familiar continuity equation (29.3). 
Taking the divergence of both sides of (30.3), we find 


div curl H = - div E+ — = divi. 


But div curl H = 0 and div E = 4zg, according to (04) Thus we arrive once more at 
equation (29.3). In four-dimensional form, from (30.2), we have: 


But when the operator 0?/0x'ox*, which is symmetric in the indices i and k, is applied to 
the antisymmetric tensor F', it gives zero identically and we arrive at the continuity 
equation (29.4) expressed in four-dimensional form. 


§ 31. Energy density and energy flux 


Let us multiply both sides of (30.3) by E and both sides of (26.1) by H and combine the 
resultant equations. Then we get 


Dg. E 1g. OH nj. E — (H -curl E-E: curl H). 
c Ot c a c 
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Using the well-known formula of vector analysis, 
div (a x b) = b: curl a—a-curlb, 
we rewrite this relation in the form 


L3. 2s 3 41. . 
t = — —j-E-div(ExH 
DEL +H’) 21 iv (Ex H) 


or 


à /E?+H? l 
= ( E )= 7i £- dis. (31.1) 
The vector 

= ExH (31.2) 


is called the Poynting vector. 
We integrate (31.1) over a volume and apply Gauss’ theorem to the second term on the 
right. Then we obtain 


ô ¢ E?+H? 
at Í 8x 

If the integral extends over all space, then the surface integral vanishes (the field is zero 
at infinity). Furthermore, we can express the integral [j-EdV as a sum £ ev- E over all 
the charges, and substitute from (17.7): 


dV = - [rsev- f s-at (31.3) 


d 
ev E = dt 6 kin- 
Then (31.3) becomes 
d E?+H? 
2 f PUE ave Y Fug} =O. (31.4) 


Thus for the closed system consisting of the electromagnetic field and particles present in 
it, the quantity in brackets in this equation is conserved. The second term in this expression 
is the kinetic energy (including the rest energy of all the particles; see the footnote on p. 48), 
the first term is consequently the energy of the field itself. We can therefore call the quantity 

E?+H? ae 
ae (31.5) 
the energy density of the electromagnetic field; it is the energy per unit volume of the field. 

If we integrate over any finite volume, then the surface integral in (31.3) generally does 
not vanish, so that we can write the equation in the form 


à E'-H? 
al = ave Y Sun} =- $s dt, (31.6) 


where now the second term in the brackets is summed only over the particles present in the 
volume under consideration. On the left stands the change in the total energy of field and 
particles per unit time. Therefore the integral $ S- df must be interpreted as the flux of 
field energy across the surface bounding the given volume, so that the Poynting vector S is 
this flux density—the amount of field energy passing through unit area of the surface in 
unit time.t 


t We assume that at the given moment there are no charges on the surface itself. If this were not the case, 
then on the right we would have to include the energy flux transported by particles passing through the 
surface. 


§ 32 THE ENERGY-MOMENTUM TENSOR 77 
§ 32. The energy-momentum tensor 


In the preceding section we derived an expression for the energy of the electromagnetic 
field. Now we derive this expression, together with one for the field momentum, in four- 
dimensional form. In doing this we shall for simplicity consider for the present an clectro- 
magnetic field without charges. Having in mind later applications (to the gravitational field), 
and also to simplify the calculation, we present the derivation in a general form, not 
specializing the nature of the system. So we consider any system whose action integral has 
the form 

ôq 1 
S= Í A (a. A dVdt=- Í AdQ, (32.1) 
ôx c 


where A is some function of the quantities g, describing the state of the system, and of their 
first derivatives with respect to coordinates and time (for the electromagnetic field the 
components of the four-potential are the quantities g); for brevity we write here only one 
of the q's. We note that the space integral f A dV is the Lagrangian of the system, so that A 
can be considered as the Lagrangian "density". The mathematical expression of the fact that 
the system is closed is the absence of any explicit dependence of A on the x‘, similarly to the 
situation for a closed system in mechanics, where the Lagrangian does not depend explicitly 
on the time. 

The “equations of motion” (i.e. the field equations, if we are dealing with some field) are 
obtained in accordance with the principle of least action by varying S. We have (for brevity 
we write q,; = £q/Ox ), 


CA O^ 
as= i | (E sat E oai) 
c J \éq ; 


ôq; 
1 Q^ ð / ôA Ó GA 
= ó : ôq }—6 : =Q. 
1 [ D Us ) 4 ay a d 


The second term in the integrand, after transformation by Gauss' theorem, vanishes upon 
integration over all space, and we then find the following “equations of motion”: 


2 ðA ôA 
pu ccu (322) 
Ox'Oq,; dq 


(it is, of course, understood that we sum over any repeated index). 
The remainder of the derivation is similar to the procedure in mechanics for deriving the 


conservation of energy. Namely, we write: 
O^ OAOq , O^ OG, 


Ox! dq Ox’ ðq, Ox 
Substituting (32.2) and noting that q,,, ; — q,;,,, we find 


CA O0 (O^ O^ ôq; ô oA 
aay = ck a i tO Oe = SR zu 
cq, x 


éx'  êx* \eq., ðq, Ox* — Ox 
On the other hand, we can write 
OA os oA 
Ox ax 
so that, introducing the notation 
gig 93 nia. (32.3) 
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we can express the relation in the form 
oT: 
x 
We note that if there is not one but several quantities q”, then in place of (32.3) we must 
write 


(32.4) 


-Ye I a —ól A. (32.5) 


But in § 29 we saw that an equation of the form 0A*/Ox* = 0, i.e. the vanishing of the 
four-divergence of a vector, is equivalent to the statement that the integral f A*dS, of the 
vector over a hypersurface which contains all of three-dimensional space is conserved. It is 
clear that an analogous result holds for the divergence of a tensor; the equation (32.4) 
asserts that the vector P! = const f T'*dS, is conserved. 

This vector must be identified with the four-vector of momentum of the system. We 
choose the constant factor in front of the integral so that, in accord with our previous 
definition, the time component P? is equal to the energy of the system multiplied by l/c. 
To do this we note that 


P? = const f TdS, = const f T°°dV 


if the integration is extended over the hyperplane x° = const. On the other hand, according 
to (32.3), 
T 2 q^ A. (a=) 


Comparing with the usual formulas relating the energy and the Lagrangian, we see that 
this quantity must be considered as the energy density of the system, and therefore f T°°dV 
is the total energy of the system. Thus we must set const = l/c, and we get finally for the 
four-momentum of the system the expression 


. 1 ; 
P= 3l TS. (32.6) 


The tensor 7'* is called the energy-momentum tensor of the system. 
It is necessary to point out that the definition of the tensor 7 is not unique. In fact, if 
T* is defined by (32.3), then any other tensor of the form 


T* + i yh, yu y" (32.7) 


will also satisfy equation (32.4), since we have identically d?7y'"/dx*éx! = 0. The total four- 
momentum of the system does not change, since according to (6.17) we can write 


ay" A ikl ikl 1 
as etf (S 2) =! [ver 


where the integration on the right side of the m is extended over the (ordinarv) surface 
which “bounds” the hypersurface over which the integration on the left is taken. This surface 
is clearly located at infinity in the three-dimensional space, and since neither field nor particles 
are present at infinity this integral is zero. Thus the four-momentum of the system is, as it 
must be, a uniquely determined quantity. To define the tensor 7* uniquely we can use the 
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requirement that the four-tensor of angular momentum (see § 14) of the system be expressed 
in terms of the four-momentum by 


, : 1 . F 
M*- I Gd Pt — x'ap^) = - I (xT! — x* TUS, (32.8) 


that is its "density" is expressed in terms of the "density" of momentum by the usual 
formula. 

It is easy to determine what conditions the energy-momentum tensor must satisfy in order 
that this be valid. We note that the law of conservation of angular momentum can be 
expressed, as we already know, by setting equal to zero the divergence of the expression under 
the integral sign in M*. Thus 


7 (xi T"! — x*T!) = 0. (32.9) 
x 


Noting that €x‘/0x' = ôi and that 0T*'/0x! = 0, we find from this 
SiT" —o T! = TH_T#* = 0 
or 
TEST (32.10) 
that is, the energy-momentum tensor must be symmetric. 

We note that 7“, defined by formula (32.5), is generally speaking not symmetric, but can 
be made so by transformation (32.7) with suitable py‘. Later on (§ 94) we shall see that 
there is a direct method for obtaining a symmetric tensor T*. 

As we mentioned above, 1f we carry out the integration in (32.6) over the hyperplane 
x? = const, then P! takes on the form 


Pi = : | TV, (32.11) 
c 


where the integration extends over the whole (three-dimensional) space. The space com- 
ponents of P' form the three-dimensional momentum vector of the system and the time 
component is its energy multiplied by 1/c. Thus the vector with components 


may be called the “momentum density", and the quantity 
W- T" 
the "energy density". 
To clarify the meaning of the remaining components of T'*, we separate the conservation 
equation (32.4) into space and time parts: 
1 ôT” T” 1 ôT oT” 
-— bà -=+ 
c ôt Ox" c ôt Óx 
We integrate these equations over a volume V in space. From the first equation 
10 ôT” 
sgl eav | ~dV =0 
c ôt ex 
or, transforming the second integral by Gauss’ theorem, 


: f T°°dV = —c $ T?*df,, (32.13) 


= (0). (32.12) 


80 THE ELECTROMAGNETIC FIELD EQUATIONS § 33 


where the integral on the right is taken over the surface surrounding the volume V (df,, 
df,, df, are the components of the three-vector of the surface element df). The expression on 
the left is the rate of change of the energy contained in the volume V; from this it is clear 
that the expression on the right is the amount of energy transferred across the boundary of 
the volume V, and the vector S with components 


cT?!, cT??, cT? 


is its flux density—the amount of energy passing through unit surface in unit time. Thus we 
arrive at the important conclusion that the requirements of relativistic invariance, as 
expressed by the tensor character of the quantities 7**, automatically lead to a definite 
connection between the energy flux and the momentum density: the energy flux density 
is equal to the momentum density multiplied by c?. 

From the second equation in (32.12) we find similarly: 


2 Í i T*94y = — $ T"^qf,. (32.14) 
ot Jc 
On the left is the change of the momentum of the system in volume V per unit time, therefore 
$ T” df, is the momentum emerging from the volume V per unit time. Thus the components 
T* of the energy-momentum tensor constitute the three-dimensional tensor of momentum 
flux density; we denote it by —6,5, where o,, is the stress tensor. The energy flux density 
is a vector; the density of flux of momentum, which is itself a vector, must obviously be a 
tensor (the component T, of this tensor is the amount of the a-component of the 
momentum passing per unit time through unit surface perpendicular to the x^ axis). 

We give a table indicating the meanings of the individual components of the energy- 
momentum tensor: 


W Sde Sde S,/c 


; S — — — 
pia [$e Fx Oy Fae | (32.15) 
S, je — Oys 70, m 
S.lc — 62 m 0;, — 0; 


§ 33. Energy-momentum tensor of the electromagnetic field 


We now apply the general relations obtained in the previous section to the electromagnetic 
field. For the electromagnetic field, the quantity standing under the integral sign in (32.1) is 
equal, according to (27.4), to 


The quantities q are the components of the four-potential of the field, A,, so that the definition 
(32.5) of the tensor T* becomes 
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To calculate the derivatives of A which appear here, we find the variation 5A. We have 


1 1 cA 0A 
ôA = — — F" ôF y = — — F" (s H § 
8n ki 8x àx* óx! 
or, interchanging indices and making use of the fact that Fy, = — Fi, 
1 0A 
6A = — — FUA T 
ax* 
From this we see that 
OA l Lu 
uy 
Óx 
and therefore 
1 8A, ba 
TE = — LL py EF", 
' 4n Ox! * lór | ^" 


According to the field equation (30.2) in the absence of charges, CF*/éx, = 0, and therefore 


1 04! 1 0,, 
LL F* -—— (AiF*5, 
4n ôx 
so that the change made in T*™ is of the form (32.7) and is admissible. Since 0A'/0x; — 0 A'/x, 
= F", we get finally the following expression for the energy-momentum tensor of the 
electromagnetic field: 


4n Óx, 


; 1 i i 
T* = az (ota g* Fin F”). (33.1) 


This tensor is obviously symmetric. In addition it has the property that 
Ti =0, (33.2) 


i.e. the sum of its diagonal terms is zero. 

Let us express the components of the tensor T* in terms of the electric and magnetic field 
intensities. By using the values (23.5) for the components F**, we easily verify that the 
quantity 79? coincides with the energy density (31.5), while the components cT ?* are the 
same as the components of the Poynting vector (31.2). The space components T* form 
a three-dimensional tensor with components 


1 


1 
oxy = ar (EE, + H,H,), 
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etc., or 
l 
747 1 {+ E,E, + H, Hy — 1o, (E? + H?)). (33.3) 


This tensor is called the Maxwell stress tensor. 

To bring the tensor T, to diagonal form, we must transform to a reference system in 
which the vectors E and H (at the given point in space and moment in time) are parallel to 
one another or where one of them is equal to zero; as we know (8 25), such a transformation 
is always possible except when E and H are mutually perpendicular and equal in magnitude. 
It is easy to see that after the transformation the only non-zero components of T^ will be 


T° = —T!! = T?? = T3? = Ww 


(the x axis has been taken along the direction of the field). 
But if the vectors E and H are mutually perpendicular and equal in magnitude, the tensor 
T cannot be brought to diagonal form.t The non-zero components in this case are 


T°? = T3? = T? = W 


(where the x axis is taken along the direction of E and tne y axis along H). 

Up to now we have considered fields in the absence of charges. When charged particles are 
present, the energy-momentum tensor of the whole system is the sum of the energy- 
momentum tensors for the electromagnetic field and for the particles, where in the latter the 
particles are assumed not to interact with one another. 

To determine the form of the energy-momentum tensor of the particles we must describe 
their mass distribution in space by using a "mass density" in the same way as we describe a 
distribution of point charges in terms of their density. Analogously to formula (28.1) for 
the charge density, we can write the mass density in the form 


u=} m, (r-r,), (33.4) 


where r, are the radius-vectors of the particles, and the summation extends over all the 
particles of the system. 

The “four-momentum density" of the particles is given by jicu;. We know that this density 
is the component T?*?/c of the energy-momentum tensor, i.e. T° = u cu? (a = 1.2, 3). But 
the mass density is the time component of the four-vector z/c (dx"/dt) (in analogy to the 
charge density; see $ 28). Therefore the energy-momentum tensor of the system of non- 
interacting particles is 


T" = pe 7 — = pucu u" = (33.5) 


As expected, this tensor is symmetric. 


We verify by a direct computation that the energy and momentum of the system, defined 
as the sum of the energies and momenta of field and particles, are actually conserved. In 
other words we shall verify the equations. 


x (T+ TO) = 0, (33.6) 
Xk 


which express these conservation laws. 


+ The fact that the reduction of the symmetric tensor 7 to principal axes may be impossible is related to 
the fact that the four-space is pseudo-euclidean. (See also the problem in § 94.) 
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Differentiating (33.1), we write 


OTD L (pn Fie _ pu OF p, 
dx* — 4n\2° OX! Ax" ox  # 
Substituting from the Maxwell equations (26.5) and (30.2), 
OF" 4n 4 OF im Fmi OF i 
àx c" Ox! |. Ox Ox" 
we have: 


üx* — 4nX 2 ax! 20x" ax* 
By permuting the indices, we easily show that the first three terms on the right cancel one 
another, and we arrive at the result: 


eT» 1 . 
= LE (33.7) 


eTUX* 1 lF,, 1 OF; oF; 4x | 
i ( LIS n Fin LIH pim Ltd FRE _ 2) 


OX, 

Differentiating the expression (33.5) for the energy-momentum tensor of the particles gives 
oT ô dx" dx* ðu; 
axt T i xk (, di ) ur 

The first term in this expression is zero because of the conservation of mass for non- 

interacting particles. In fact, the quantities u(dx*/dt) constitute the “mass current" four- 


vector, analogous to the charge current four-vector (28.2); the conservation of mass is 
expressed by equating to zero the divergence of this four-vector: 


ð dx* 
—l(u—)= 33.8 
Bx (u di ) 0, (33.8) 


just as the conservation of charge is expressed by equation (29.4). 
Thus we have: 


aT dx* du; du; 
Ox CI ae a I ge 

Next we use the equation of motion of the charges in the field, expressed in the four- 

dimensional form (23.4). 


Changing to continuous distributions of charge and mass, we have, from the definitions of 
the densities u and o: u/m = o/e. We can therefore write the equation of motion in the form 


du. 
c = =£ Fau" 
s 
or 
du; 1 ds 
HoT c ag wh = Faj" 
Thus, 
OTO | 
a = Fall (33.9) 


Combining this with (33.7), we find that we actually get zero, i.e. we arrive at equation 
(33.6). 
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PROBLEM 


Find the law of transformation of the energy density, the energy flux density, and the components 
of the stress tensor under a Lorentz transformation. 

Solution: Suppose that the K’ coordinate system moves relative to the K system along the x axis 
with velocity V. Applying the formulas of problem 1, §6 to the symmetric tensor T*, we find: 


1 „Yao Vu 
W =— W'S om > 


1 y? + + + 
S,= 2 Sub S;— VW Vo. | 
m 
1 , 
$,- (S, — Vo), 
m 
1 + V + y? + 
Oxx = z( Xx 2 i95 2 w) 
(d 
Oy, = Ty » CO, 0; Oy, 0, 
1 , V , 
On = ; Oya; 
arg 


and similar formulas for S, and o,,. 


§ 34. The virial theorem 


Since the sum of the diagonal terms of the energy-momentum tensor of the electro- 
magnetic field is equal to zero, the sura Ti for any system of interacting particles reduces to 
the trace of the energy-momentum tensor for the particles alone. Using (33.5), we therefore 
have: 


; d d 2 
Ti = T?! = pou,u' p = uc p = uc? Ji- T 


Let us rewrite this result, shifting to a summation over the particles, i.e. writing u as the sum 
(33.4). We then get finally: 


Ti- Smet fi- 556r) G4.) 
We note that, according to this formula, we have for every system: 
Ti 20, (34.2) 


where the equality sign holds only for the electromagnetic field without charges. 
Let us consider a closed system of charged particles carrying out a finite motion, in which 
all the quantities (coordinates, momenta) characterizing the system vary over finite ranges.t 


t Here we also assume that the electromagnetic field of the system goes to zero sufficiently rapidly at infinity. 
In specific cases this condition may require the neglect of radiation of electromagnetic waves by the system. 
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We average the equation 
1 ôT? " aT? 
c ôt ax? 


[see (32.11)] with respect to the time. The average of the derivative 0T"9/0t, like the average 
of the derivative of any bounded quantity, is zero.t Therefore we get 


ó — 

L Tha 

5x8 T3 =0. 

We multiply this equation by x* and integrate over all space. We transform the integral by 
Gauss’ theorem, keeping in mind that at infinity T7 = 0, and so the surface integral vanishes: 


=0 


oT? xt. pm 
f dv=- | Thay = - | sra =0, 
or finally, 
| T?dV =0. (34.3) 


On the basis of this equality we can write for the integral of Ti = T*4 T9: 
| Tiav = [ Tav =, 
where @ is the total energy of the system. 
Finally, substituting (34.1) we get: 


2 
&-Yymc "i 1- a (34.4) 


This relation is the relativistic generalization of the virial theorem of classical mechanics. 
(See Mechanics, §10.) For low velocities, it becomes 


2 

£-} m,e = ayes, 
a a 2 

that is, the total energy (minus the rest energy) is equal to the negative of the average value 

of the kinetic energy—in agreement with the result given by the classical virial theorem for a 

system of charged particles (interacting according to the Coulomb law). 

We must point out that our formulas have a quite formal character and need to be made 
more precise. The point is that the electromagnetic field energy contains terms that give an 
infinite contribution to the electromagnetic self-energy of point charges (see $37). To give 
meaning to the corresponding expressions we should omit these terms, considering that the 
intrinsic electromagnetic energy is already included in the kinetic energy of the particle (9.4). 
This means that we should *renormalize" the energy making the replacement 

2 2 
say [Pay 


+ Let f(t) be such a quantity. Then the average value of the derivative df/dt over a certain time interval T is 


- T 
dí Y Y u ST) —f(0) 
d TJ dt T = 
o 
Since f(t) varies only within finite limits, then as T increases without limit, the average value of df/dt clearly 
goes to zero. 
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in (34.4), where E, and H, are the fields produced by the a’th particle. Similarly in (34.3) 
we should make the replacementt 


E? + H? 
[rav | riv +y av 


$ 35. The energy-momentum tensor for macroscopic bodies 


In addition to the energy-momentum tensor for a system of point particles (33.5), we shall 
also need the expression for this tensor for macroscopic bodies which are treated as being 
continuous. 

The flux of momentum through the element df of the surface of the body is just the force 
acting on this surface element. Therefore —c,; df; is the «-component of the force acting 
on the element. Now we introduce a reference system in which a given element of volume 
of the body is at rest. In such a reference system, Pascal's law is valid, that is, the pressure p 
applied to a given portion of the body is transmitted equally in all directions and is every- 
where perpendicular to the surface on which it acts.] Therefore we can write o,, df, = 
— pdf, , so that the stress tensor is o, = — pó,s. As for the components T”, which represent 
the momentum density, they are equal to zero for the given volume element in the reference 
system we are using. The component T®” is as always the energy density of the body, which 
we denote by £; £/c? is then the mass density of the body, i.e. the mass per unit volume. 
We emphasize that we are talking here about the unit "proper" volume, that is, the volume 
in the reference system in which the given portion of the body is at rest. 

Thus, in the reference system under consideration, the energy-momentum tensor (for the 
given portion of the body) has the form: 


T*- (35.1) 


oom 


0 
0 
p 


ooo 


0 
P 
0 
00 0 p 
Now it is easy to find the expression for the energy-momentum tensor in an arbitrary 
reference system. To do this we introduce the four-velocity u‘ for the macroscopic motion 
of an element of volume of the body. In the rest frame of the particular element, u° = (1, 0). 
The expression for T* must be chosen so that in this reference system it takes on the form 
(35.1). It is easy to verify that this is 
T* = (p+e)uiu*— pg, (35.2) 
or, for the mixed components, 
Ti = (p+ e)uju* — pôi. 


This expression gives the energy-momentum tensor for a macroscopic body. The 


. . , E; + H; Mba , 
T Note that without this change the expression [ro IE ——— dV Y —— is essentially 
positive and cannot vanish. 8n a Ji- vic? 

f Strictly speaking, Pascal’s law is valid only for liquids and gases. However, for solid bodies the maximum 
possible difference in the stress in different directions is negligible in comparison with the stresses which can play 


a role in the theory of relativity, so that its consideration is of no interest. 
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expressions for the energy density W, energy flow vector S and stress tensor c,, are: 


v? 
e+p > 
c (pt+e)v 
W= v' S= v' 
1—— 1—--; 
c? c? (35.3) 


If the velocity v of the macroscopic motion is small compared with the velocity of light, then 
we have approximately: 

S =(p+t+e)v. 
Since S/c? is the momentum density, we see that in this case the sum (p-c)/c? plays the 
role of the mass density of the body. 

The expression for T* simplifies in the case where the velocities of all the particles making 
up the body are small compared with the velocity of light (the velocity of the macroscopic 
motion itself can be arbitrary). In this case we can neglect, in the energy density e, all terms 
small compared with the rest energy, that is, we can write pg c? in place of c, where po is 
the sum of the masses of the particles present in unit (proper) volume of the body (we 
emphasize that in the general case, jj, must differ from the actual mass density e/c? of the 
body, which includes also the mass corresponding to the energy of microscopic motion of 
the particles in the body and the energy of their interactions). As for the pressure determined 
by the energy of microscopic motion of the molecules, in the case under consideration it is 
also clearly small compared with the rest energy jt, c?. Thus we find 

T* = uo c^ulw*. (35.4) 

From the expression (35.2), we get 

Ti =e—3p. (35.5) 

The general property (34.2) of the energy-momentum tensor of an arbitrary system now 

shows that the following inequality is always valid for the pressure and density of a macro- 
scopic body: 

p< > (35.6) 

Let us compare the relation (35.5) with the general formula (34.1) which we saw was valid 
for an arbitrary system. Since we are at present considering a macroscopic body, the expres- 
sion (34.1) must be averaged over all the values of r in unit volume. We obtain the result 


2 
e~3p = Y mc NIE (35.7) 


(the summation extends over all particles in unit volume). 
The right side of this equation tends to zero in the ultrarelativistic limit, so in this limit 
the equation of state of matter is:t 


peus (35.8) 

t This limiting equation of state is obtained here assuming an electromagnetic interaction between the 

particles. We shall assume (when this is needed in Chapter 14) that it remains valid for the other possible 
interactions between particles, though there is at present no proof of this assumption. 
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We apply our formula to an ideal gas, which we assume to consist of identical particles. 
since the particles of an ideal gas do not interact with one another, we can use formula 
(33.5) after averaging it. Thus for an ideal gas, 

T^ dx! dx* 
= nme — — 
dt ds’ 
where n is the number of particles in unit volume and the dash means an average over all the 
particles. If there is no macroscopic motion in the gas then we can use for T the expression 
(35.1). Comparing the two formulas, we arrive at the equations: 


c? nm v? 


"wp Pa al 
e a 


These equations determine the density and pressure of a relativistic ideal gas in terms of the 


e= nm 


(35.9) 


velocity of its particles; the second of these replaces the well-known formula p = nmv?/3 of 
the nonrelativistic kinetic theory of gases. 


CHAPTER 5 


CONSTANT ELECTROMAGNETIC FIELDS 


§ 36. Coulomb’s law 


For a constant electric, or as it is usually called, electrostatic field, the Maxwell equations 
have the form: 


div E = 472p, (36.1) 

curl E = 0. (36.2) 
The electric field E is expressed in terms of the scalar potential alone by the relation 

E = — grad ¢. (36.3) 


Substituting (36.3) in (36.1), we get the equauon which is satisfied by the potential of a 


constant electric field: 
A$ = —4np. (36.4) 


This equation is called the Poisson equation. In particular, in vacuum, i.e., for ọ = 0, the 
potential satisfies the Laplace equation 


Ag =0. (36.5) 


From the last equation it follows, in particular, that the potential of the electric field can 
nowhere have a maximum or a minimum. For in order that $ have an extreme value, it 
would be necessary that the first derivatives of ¢ with respect to the coordinates be zero, 
and that the second derivatives 02$/0x?, 0?$/0y?, 0?/0z? all have the same sign. The last 
is impossible, since in that case (36.5) could not be satisfied. 

We now determine the field produced by a point charge. From symmetry considerations, 
it is clear that it is directed along the radius-vector from the point at which the charge e is 
located. From the same consideration it is clear that the value E of the field depends only on 
the distance R from the charge. To find this absolute value, we apply equation (36.1) in the 
integral form (30.5). The flux of the electric field through a spherical surface of radius R 
circumscribed around the charge e is equal to 4x RE; this flux must equal 4ne. From this we 


get 


e 
E = RE 
In vector notation: 
R 
E= : (36.6) 


Thus the field produced by a point charge is inversely proportional to the square of the 
89 
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distance from the charge. This is the Coulomb law. The potential of this field is, clearly, 
e 
=-, 36.7 
$-5 (36.7) 


If we have a system of charges, then the field produced by this system is equal, according 
to the principle of superposition, to the sum of the fields produced by each of the particles 
individually. In particular, the potential of such a field is 


where R, is the distance from the charge e, to the point at which we are determining the 
potential. If we introduce the charge density o, this formula takes on the form 


E s 
$- Í gV (36.8) 


where R is the distance from the volume element dV to the given point of the field. 
We note a mathematical relation which is obtained from (36.4) by substituting the values 
of e and ¢ for a point charge, i.e. o = e 6(R) and ¢ = e/R. We then find 


A (s) = —4n 6(R). (36.9) 


§ 37. Electrostatic energy of charges 


We determine the energy of a system of charges. We start from the energy of the field, that 
is, from the expression (31.5) for the energy density. Namely, the energy of the system of 
charges must be equal to 


porn 
=— | E*dV, 
8x | 


where E is the field produced by these charges, and the integral goes over all space. Sub- 
stituting E = —grad ¢, U can be changed to the following form: 


l 1 
-t IE TEL e [aiv o) dV + s; | div Ear. 
8x 8x 8x 


According to Gauss' theorem, the first integral is equal to the integral of Eó over the surface 
bounding the volume of integration, but since the integral is taken over all space and sincé 
the field is zero at infinity, this integral vanishes. Substituting in the second integral, 
div E = 4zg, we find the following expression for the energy of a system of charges: 

U =} [ o dV. (37.1) 


For a system of point charges, e,, we can write in place of the integral a sum over the 
charges 
U -1 6,0, (37.2) 
a 


where @, is the potential of the field produced by all the charges, at the point where the 
charge e, is located. 
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If we apply our formula to a single elementary charged particle (say, an electron), and the 
field which the charge itself produces, we arrive at the result that the charge must have a 
certain "self "-potential energy equal to e@/2, where ¢ is the potential of the field produced 
by the charge at the point where it is located. But we know that in the theory of relativity 
every elementary particle must be considered as pointlike. The potential $ — e/R of its field 
becomes infinite at the point R = 0. Thus according to electrodynamics, the electron would 
have to have an infinite “self-energy”, and consequently also an infinite mass. The physical 
absurdity of this result shows that the basic principles of electrodynamics itself lead to the 
result that its application must be restricted to definite limits. 

We note that in view of the infinity obtained from electrodynamics for the self-energy and 
mass, it is impossible within the framework of classical electrodynamics itself to pose the 
question whether the total mass of the electron is electrodynamic (that is, associated with the 
electromagnetic self-energy of the particle). 

Since the occurrence of the physically meaningless infinite self-energy of the elementary 
particle is related to the fact that such a particle must be considered as pointlike, we can 
conclude that electrodynamics as a logically closed physical theory presents internal con- 
tradictions when we go to sufficiently small distances. We can pose the question as to the 
order of magnitude of such distances. We can answer this question by noting that for the 
electromagnetic self-energy of the electron we should obtain a value of the order of the rest 
energy mc?. If, on the other hand, we consider an electron as possessing a certain radius Ro, 
then its self-potential energy would be of order e?/R,. From the requirement that these two 


quantities be of the same order, e?/Ry ~ mc?, we find 


e? 


Ro~ mar’ (37.3) 

This dimension (the "radius" of the electron) determines the limit of applicability of 

electrodynamics to the electron, and follows already from its fundamental principles. We 

must, however, keep in mind that actually the limits of applicability of the classical electro- 

dynamics which is presented here lie much higher, because of the occurrence of quantum 
phenomena.1 

We now turn again to formula (37.2). The potentials $, which appear there are equal, from 


Coulomb's law, to 
e 
b= Y V. (37.4) 
ab 


where Ras is the distance between the charges e,, ej. The expression for the energy (37.2) 
consists of two parts. First, it contains an infinite constant, the self-energy of the charges, not 
depending on their mutual separations. The second part is the energy of interaction of the 
charges, depending on their separations. Only this part has physical interest. It is equal to 


U'=4} epo (37.5) 
where 
" €, 
=> R. (37.6) 
b*a Ngp 


f From the purely formal point of view, the finiteness of the electron mass can be handled by introducing 
an infinite negative mass of nonelectromagnetic origin which compensates the infinity of the electromagnetic 
mass (mass "renormalization"). However, we shall see later (§ 75) that this does not eliminate all the internal 
contradictions of classical electrodynamics. 

1 Quantum effects become important for distances of the order of h/mc, where h is Planck's constant. The ratio 
of these distances to R, is of order Ac/e? ~ 137. 
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is the potential at the point of location of e,, produced by all the charges other than e,. In 
other words, we can write 


1 e,€, 
U'z- =, 37.7 
2 2. Ra ( ) 
In particular, the energy of interaction of two charges is 
eie 
U'z . (37.8) 
Ri; 


$ 38. The field of a uniformly moving charge 


We determine the field produced by a charge e, moving uniformly with velocity V. We 
call the laboratory frame the system K; the system of reference moving with the charge is the 
K' system. Let the charge be located at the origin of coordinates of the K' system. The 
system K’ moves relative to K along the X axis; the axes Y and Z are parallel to Y' and Z’. 
At the time f = 0 the origins of the two systems coincide. The coordinates of the charge in 
the K system are consequently x = Vt, y= z —0. In the K’ system, we have a constant 
electric field with vector potential A’ = 0, and scalar potential equal to @’ = e/ R', where 
R? = x’? +y? +z”. In the K system, according to (24.1) for A’ = 0, 

f e 
PE 


y y? 
is Rifts 
"| c? c? 


We must now express R’ in terms of the coordinates x, y, z, in the K system. According to 
the formulas for the Lorentz transformation 


(38.1) 


x—Vt 


from which 
2 


(x— Vt)? + (1- =) (y? +27) 


R? = (38.2) 
y? 
{= 
c? 
Substituting this in (38.1) we find 
e 
¢= R (38.3) 
where we have introduced the notation 
y? 
R9 = (x—V1)? + (1- =) (y? 4 z?). (38.4) 


The vector potential in the K system is equal to 


V ev 
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In the K’ system the magnetic field H’ is absent and the electric field is 


From formula (24.2), we find 
pes a E,= NE. e E = 
ERNE 
c? c? 
ez' 


Substituting for R’, x’, y’, z', their expressions in terms of x, y, z, we obtain 


V?v eR 
E- ( - =) Le (38.6) 


where R is the radius vector from the charge e to the field point with coordinates x, y, z (its 
components are x— Vt, y, 2). 

This expression for E can be written in another form by introducing the angle 0 between 
the direction of motion and the radius vector R. It is clear that y? -- 2? = R? sin? 0, and there- 
fore R*? can be written in the form: 


V? 
R*? = R? (1 uec sin? e) (38.7) 
Then we have for E, 
V? 
j- EN 
eR c? 
E- R? y? 3/2 (38.8) 
( Tc sin? ) 
c 


For a fixed distance R from the charge, the value of the field E increases as Ó increases 
from 0 to 2/2 (or as @ decreases from z to 2/2). The field along the direction of motion 
((8 = 0, x) has the smallest value; it is equal to 


e y? 


The largest field is that perpendicular to the velocity (0 = 1/2), equal to 


1 
E, — 


l ue je 

c? 
We note that as the velocity increases, the field E, decreases, while E, increases. We can 
describe this pictorially by saying that the electric field of a moving charge is “contracted” 
in the direction of motion. For velocities V close to the velocity of light, the denominator 


in formula (38.8) is close to zero in a narrow interval of values @ around the value 0 = z/2. 
The “width” of this interval is, in order of magnitude, 


72 
ao~ i-i 
c 
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Thus the electric field of a rapidly moving charge at a given distance from it is large only ina 
narrow range of angles in the neighborhood of the equatorial plane, and the width of this 
interval decreases with increasing V like /1—(V ?/c?). 

The magnetic field in the K system is 


1 
H--VxE (38.9) 
c 


[see (24.5)]. In particular, for V <c the electric field is given approximately by the usual 
formula for the Coulomb law, E = eR/R?, and the magnetic field is 
eVxR 
H = 


= oR (38.10) 


PROBLEM 


Determine the force (in the K system) between two charges moving with the same velocity V. 
Solution: We shall determine the force F by computing the force acting on one of the charges (ei) 
in the field produced by the other (e2). Using (38.9), we have 
2 
F =e, E24 E V«H, - e, ( - =) E,- 2 W(V Ej. 
[4 c7 c 


Substituting for E; from (38.8). we get for the components of the force in the direction of motion 
(F,) and perpendicular to it (F,): 


V? y2 2 
UL (1-5) sin 0 
€1€2 u ei ez c 
F- R? y? 3/2? F, 7. r0  — 8ja 
(-z sin? ) 


where R is the radius vector from es to ei, and @ is the angle between R and V. 


§ 39. Motion in the Coulomb field 


We consider the motion of a particle with mass m and charge e in the field produced by a 
second charge e'; we assume that the mass of this second charge is so large that it can be 
considered as fixed. Then our problem becomes the study of the motion of a charge e in a 
centrally symmetric electric field with potential @ = e’/r. 

The total energy & of the particle is equal to 


———;5 4 
E =e p Hmh, 
" 


where a = ee’. If we use polar coordinates in the plane of motion of the particle, then as we 
know from mechanics, 

p? = Qr) py, 
where p, is the radial component of the momentum, and M is the constant angular momen- 
tum of the particle. Then 


, M* ,, a 
eoe pt M eie (39.1) 
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We discuss the question whether the particle during its motion can approach arbitrarily 
close to the center. First of all, it is clear that this is never possible if the charges e and e’ 
repel each other, that is, if e and e’ have the same sign. Furthermore, in the case of attraction 
(e and e’ of opposite sign), arbitrarily close approach to the center is not possible if Mc > |al, 
for in this case the first term in (39.1) is always larger than the second, and for r — O the 
right side of the equation would approach infinity. On the other hand, if Mc < |a], then 
as r — 0, this expression can remain finite (here it is understood that p, approaches infinity). 
Thus, if 
cM < |a], (39.2) 

the particle during its motion "falls in" toward the charge attracting it, in contrast to non- 
relativistic mechanics, where for the Coulomb field such a collapse is generally impossible 
(with the exception of the one case M = 0, where the particle e moves on a line toward the 
particle e^). 

A complete determination of the motion of a charge in a Coulomb field starts most 
conveniently from the Hamilton-Jacobi equation. We choose polar coordinates r, $, in 
the plane of the motion. The Hamilton-Jacobi equation (16.11) has the form 


1/0S aM OSA? . 1/0S8V 35 
Gt) «(2) * 5 (55) Tm'c*z0. 
We seek an S of the form 
S= —6t- Mócf(r), 


where & and M are the constant energy and angular momentum of the moving particle. The 


result is 
M? 
S=—-€t+M¢+ | Ji (e-*) - M. mee («- -f cocum ?c?dr. (39.3) 


The trajectory is determined by the equation 0S/0M = const. Integration of (39.3) leads to 
the following results for the trajectory: 


(a) If Mc > la], a 
(M? ~-a?) - = c (MEY — m?c?(M 2c? — a?) cos (s Vi ae 
(b) If Mc < lal, 


2 


i ESPERE 
(0? — Mc?) DT E cV (MEY + m?c?(a? — M?c?) cosh (s ee 2T 


EST 1) +8a, (39.5) 


(c) If Mc = la], : 
28a 224. 42 (= 
= &*—m’*c"—¢ m) (39.6) 
The integration constant is contained in the arbitrary choice of the reference line for 
measurement of the angle ¢. 

In (39.4) the ambiguity of sign in front of the square root is unimportant, since it already 
contains the arbitrary reference origin of the angle $ under the cos. In the case of attraction 
(a < 0) the trajectory corresponding to this equation lies entirely at finite values of r (finite 
motion), if £ < mc’. If £ > mc?, then r can go to infinity (infinite motion). The finite motion 
corresponds to motion in a closed orbit (ellipse) in nonrelativistic mechanics. From (39.4) 
it is clear that in relativistic mechanics the trajectory can never be closed; when the angle $ 
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changes by 27, the distance r from the center does not return to its initial value. In place of 
ellipses we here get orbits in the form of open “rosettes”. Thus, whereas in nonrelativistic 
mechanics the finite motion in a Coulomb field leads to a closed orbit, in relativistic 
mechanics the Coulomb field loses this property. 

In (39.5) we must choose the positive sign for the root in case « < 0, and the negative sign 
if « > 0 [the opposite choice of sign would correspond to a reversal of the sign of the root in 
(39.1)]. 

For a < 0 the trajectories (39.5) and (39.6) are spirals in which the distance r approaches 
Oas $ — oo. The time required for the "falling in" of the charge to the coordinate origin is 
finite. This can be verified by noting that the dependence of the coordinate r on the time is 
determined by the equation S/O = const; substituting (39.3), we see that the time is 
determined by an integral which converges for r — 0. 


PROBLEMS 


1. Determine the angle of deflection of a charge passing through a repulsive Coulomb field 
(a >0). 

Solution: The angle of deflection y equals y = x — 2, where 24, is the angle between -the two 
asymptotes of the trajectory (39.4). We find 


2cM n ( V cM? — =) 
tan“! { ——————— |, 


SS SS 
V c?M?—a? ca 


where v is the velocity of the charge at infinity. 

2. Determine the effective scattering cross section at small angles for the scattering of particles 
in a Coulomb field. 

Solution: The effective cross section do is the ratio of the number of particles scattered per second 
into a given element do of solid angle to the flux density of impinging particles (i.e., to the number 
of particles crossing one square centimeter, per second, of a surface perpendicular to the beam of 
particles). 

Since the angle of deflection z of the particle during its passage through the field is determined 
by the impact parameter ọ (i.e. the distance from the center to the line along which the particle 
would move in the absence of the field), 


X= 


da = 2nede = one 2 di = de ae 


d. T dx sin x’ 

where do = 2z sin xdx.t The angle of deflection (for small angles) can be taken equal to the ratio 
of the change in momentum to its initial value. The change in momentum is equal to the time integral 
of the force acting on the charge, in the direction perpendicular to the direction of motion; it is 
approximately (a/r?):(p/r). Thus we have 


+x 
N ag dt _ 2a 
r^p (Q?-- vr? pov 


-x 


(v is the velocity of the particles). From this we find the effective cross section for small x: 


2 
do —4 (5) ae. 
po) x 


In the nonrelativistic case, p = mv, and the expression coincides with the one obtained from the 
Rutherford formula} for small x. 


t See Mechanics, § 18. 
1 See Mechanics, $ 19. 
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§ 40. The dipole moment 


We consider the field produced by a system of charges at large distances, that is, at 
distances large compared with the dimensions of the system. 

We introduce a coordinate system with origin anywhere within the system of charges. 
Let the radius vectors of the various charges be r,. The potential of the field produced by 
all the charges at the point having the radius vector Rg is 


€a 
p= 2 Rari (40.1) 


(the summation goes over all charges); here Rọ —r, are the radius vectors from the charges 
€a to the point where we are finding the potential. 
We must investigate this expansion for large Ro(Ro > r,). To do this, we expand it in 
powers of r,/Ro, using the formula 
f(Rg —r) = f(Ro) 7r : grad f(Ro) 
(in the grad, the differentiation applies to the coordinates of the vector Ro). To terms of first 


order, 
Ye 
Ro 


1 
o= Y, er. grad —. (40.2) 
Ro 


The sum 
d=) er, (40.3) 
is called the dipole moment of the system of charges. It is important to note that if the sum 
of all the charges, X e,, is zero, then the dipole moment does not depend on the choice of 


the origin of coordinates, for the radius vectors r, and r, of one and the same charge in two 
different coordinate systems are related by 


r, =T, tā, 
where a is some constant vector. Therefore if X e, = 0, the dipole moment is the same 


in both systems: 
d'= Y er, — Y esta tay e, — d. 
If we denote by e*, r? and e7, rj the positive and negative charges of the system and 
their radius vectors, then we can write the dipole moment in the form 


d-Yen-Yean-2Riyea-R.Le (40.4) 
Lets q-2Ahen 
X6 


are the radius vectors of the "charge centers" for the positive and negative charges. If 
Ley =} e; =e, then 


where 


(40.5) 


d=eR,., (40.6) 


where R,. = R*—R' is the radius vector from the center of negative to the center of 
positive charge. In particular, if we have altogether two charges, then R,- is the radius 
vector between them. 
If the total charge of the system is zero, then the potential of the field of this system at large 
distances is 
d- Ro 
RÈ ` 


pa -4v = (40.7) 
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The field intensity is: 


d E Ry 1 1 
. E = — grad URP -— R grad (d: Roọ)— (d: Ro) grad R? 
or finally, 
3(n-d)n—d 
E= Mnt (40.8) 
Ro 
where n is a unit vector along Ry. Another useful expression for the field is 
i 
E = (d- VV RD (40.9) 


0 


Thus the potential of the field at large distances produced by a system of charges with 
total charge equal to zero is inversely proportional to the square of the distance, and the 
field intensity is inversely proportional to the cube of the distance. This field has axial 
symmetry around the direction of d. In a plane passing through this direction (which we 
choose as the z axis), the components of the vector E are: 


3cos? 0—1 3 sin 0 cos 0 
= d —————, «= d ——á——. : 
E, —« Ri f Ri (40.10) 
The radial and tangential components in this plane are 
2 cos 0 sin 0 


8 41. Multipole moments 
In the expansion of the potential in powers of 1/Ro, 
$ 29944044904 ..., (41.1) 


the term $? is proportional to 1/R5* !. We saw that the first term, à, is determined by 
the sum of all the charges; the second term, $"'’, sometimes called the dipole potential of 
the system, is determined by the dipole moment of the system. 
The third term in the expansion is 
1 e I 
D = X eta a es 41.2 
¢ DX 0X,0X, (z.) ( ) 


where the sum goes over all charges; we here drop the index numbering the charges; x, are 
the components of the vector r, and X, those of the vector Ro. This part of the potential is 
usually called the guadrupole potential. If the sum of the charges and the dipole moment of 
the system are both equal to zero, the expansion begins with $C. 

In the expression (41.2) there enter the six quantities L ex, xz. However, it is easy to see 
that the field depends not on six independent quantities, but only on five. This follows from 
the fact that the function 1/Rp satisfies the Laplace equation, that is, 


1 o? I 
—)={6§6,——{—})=0 
S (x) 7 0X 0X, (z) 
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We can therefore write ¢° in the form 


1 1 e l 
Qu ~ = rsa) LZ . 
=Le (ss 3! «) 0X, OX, (x): 
The tensor 
Dap = Y, e(3x,x — 170,5) (41.3) 


is called the quadrupole moment of the system. From the definition of D,, it is clear that 
the sum of its diagonal elements is zero: 


Da = 0. (41.4) 
Therefore the symmetric tensor D,, has altogether five independent components. With the 
aid of D,,, we can write 
D a? l 
E A = 41.5 
? 6 0X,0X, (x) (41.5) 
or, performing the differentiation, 
e ] 3X,X; Sap 


and using the fact that 6,5 Dag = Daa = 0, 
gp = Dag nag 
2R? 


Like every symmetric three-dimensional tensor, the tensor D,, can be brought to principal 
axes. Because of (41.4), in general only two of the three principal values will be independent. 
If it happens that the system of charges is symmetric around some axis (the z axis)} then 
this axis must be one of the principal axes of the tensor D,,, the location of the other two 
axes in the x, y plane is arbitrary, and the three principal valucs are related to one another: 


D,, = D,, = —4D,,. (41.7) 
Denoting the component D,, by D (in this case it is simply called the quadrupole moment), 
we get for the potential 


D D 
o” = AR G cos? 0— 1) = 2RÀ P,(cos 0), (41.8) 
0 (0 


(41.6) 


where 0 is the angle between Ry and the z axis, and P, is a Legendre polynomial. 

Just as we did for the dipole moment in the preceding section, we can easily show that 
the quadrupole moment of a system does not depend on the choice of the coordinate origin, 
if both the total charge and the dipole moment of the system are equal to zero. 

In similar fashion we could also write the succeeding terms of the expansion (41.1). The 
I'th term of the expansion defines a tensor (which is called the tensor of the 2'-pole moment) 
of rank /, symmetric in all its indices and vanishing when contracted on any pair of indices; 
it can be shown that such a tensor has 2/+ 1 independent components. 

We shall express the general term in the expansion of the potential in another form, by 


using the well-known formula of the theory of spherical harmonics 
1 1 r 

LL, mo - y —— Pos y), (41.9) 

[R, —r| V R24 r? -2rRg COS X 170 Re! ! 


t We are assuming a symmetry axis of any order higher than the second. 
t Such a tensor is said to be irreducible. The vanishing on contraction means that no tensor of lower 
rank can be formed from the components. 
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where x is the angle between R, and r. We introduce the spherical angles ©, ® and 6, ¢, 
formed by the vectors Ro and r, respectively, with the fixed coordinate axes, and use the 
addition theorem for the spherical harmonics: 


L 
Pi(cos x)= X a ce; Pleas G)P|"l(cos 6)e -"t*-9, (41.10) 


where the P|" are the associated Legendre polynomials. 
We also introduce the spherical functions} 


21-- 1 (I — m)! 
Y,(0, 0) 2 (- 1)"i! | | — 
(6,4) = CD"! AT m) 
Y, M $)- (7-0 "Y^, 
Then the expansion (41.9) takes the form: 
1 e d. r dmn 
_ =) pitt 
[Ro r| 1=0 m=-! rRo 20/1 
Carrying out this expansion in each term of (40.1), we finally get the following expression 
for the /'th term of the expansion of the potential: 


Pricos 0)e"*, mz 0, 


(41.11) 


Y (O, 9) Y, (0, Q). 


] I 
n» NEU ®), (41.12) 
where 
4n 
Qm = D eara J Yin (Bar $a): (41.13) 


The set of 2/-- 1 quantities QU? form the 2'-pole moment of the system of charges 
The quantities Q(! defined in this way are related to the components of the dipole 
moment vector d by the formulas 


OP=id, QU- NE id,). (41.14) 


The quantities Q(? are related to the tensor components D,, by the relations 


1 
Qh--lp, QA = £- (Dat iD), 
2 v6 (41.15) 
QQ = ON e (Dxx— D,,y+2iD,y). 
PROBLEM 


Determine the quadrupole moment of a uniformly charged ellipsoid with respect to its center. 
Solution: Replacing the summation in (41.3) by an integration over the volume of the ellipsoid, 
we have: 


Dip f f f (2x?—y?—z?)dx dy dz, etc. 


Let us choose the coordinate axes along the axes of the ellipsoid with the origin at its center; from 
symmetry considerations it is obvious that these axes are the principal axes of the tensor Das. By 
means of the transformation 


, 


x=xa, y—yb, z zc 


1 In accordance with the definition used in quantum mechanics. 
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the integration over the volume of the ellipsoid 

x? y? z? 

at pt an! 
is reduced to integration over the volume of the unit sphere 

x24 y'24 7/2 = 1, 
As a result we obtain: 


D,-lQ&-P-8, yy = Ș (26?—-a?—c%), 
Daz = $ (2° —a?—b?), 


where e = (47/3)abce is the total charge of the ellipsoid. 


§ 42. System of charges in an external field 


We now consider a system of charges located in an external electric field. We designate 
the potential of this external field by ¢(r). The potential energy of each of the charges is 
€, d(r,), and the total potential energy of the system is , 


U =} e, (r). (42.1) 


We introduce another coordinate system with its origin anywhere within the system of 
charges; r, is the radius vector of the charge e, in these coordinates. 

Let us assume that the external field changes slowly over the region of the system of 
charges, i.e. is quasiuniform with respect to the system. Then we can expand the energy U 
in powers of r,: 

U =UV UDU +...; (42.2) 

in this expansion the first term is 
U = po Y en (42.3) 
where ġo is the value of the potential at the origin. In this approximation, the energy of the 
system is the same as it would be if all the charges were located at one point (the origin). 

The second term in the expansion is 


U'? = (grad $)o' Y, eats 


Introducing the field intensity E, at the origin and the dipole moment d of the system, we 
have 


uO. —d-Ep. (42.4) 
The total force acting on the system in the external quasiuniform field is, to the order we 
are considering, 


F= Eo} e, + [V(d-E)],. 
If the total charge is zero, the first term vanishes, and 


i.e. the force is determined by the derivatives of the field intensity (taken at the origin). The 
total moment of the forces acting on the system is 


K = Y (r, x e, Ej) = d x Eo, (42.6) 
i.e. To lowest order it is determined by the field intensity itself 
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Let us assume that there are two systems, each having total charge zero, and with dipole 
moments d, and d,, respectively. Their mutual distance is assumed to be large 1n comparison 
with their internal dimensions. Let us determine their potential energy of interaction, U. To 
do this we regard one of the systems as being in the field of the other. Then 

= —d, . E,. 
where E, is the field of the first system. Substituting (40.8) for E,, we find: 

(d, 4)? — 3(d, : RY(d;- R) 
= -—————-- RS - ; 
where R is the vector separation between the two systems. 


For the case where one of the systems has a total charge different from zero (and equal 
to e), we obtain similarly 


U 


(42.7) 


d-R 
U=e R’ (42.8) 
where R is the vector directed from the dipole to the charge. 
The next term in the expansion (42.1) is 


Here, as in § 41, we omit the index numbering the charge; the value of the second 


derivative of the potential is taken at the origin; but the potential $ satisfies Laplace’s 
equation, 


Therefore we can write 


or, finally, 


(2) 9B 7 TO 
U'^z 6 dx,éx, (42.9) 

The general term in the series (42.2) can be expressed in terms of the 2'-pole moments 
D! defined in the preceding section. To do this, we first expand the potential ó(r) in 
spherical harmonics; the general form of this expansion is 


z l 4n 
= 4 —— 
olr) ~ 2. ! PNE: 1 aim Yim (0, $). (42.10) 


where r, 0, $ are the spherical coordinates of a point and the a,,, are constants. Forming the 
sum (42.1) and using the definition (41.13), we obtain: 


I 
U= Y amQ. (42.11) 
1 


m=- 


§ 43. Constant magnetic field 


Let us consider the magnetic field produced by charges which perform a finite motion, in 
which the particles are always within a finite region of space and the momenta also always 
remain finite. Such a motion has a “stationary” character, and it is of interest to consider 
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the time average magnetic field H, produced by the charges; this field will now be a function 
only of the coordinates and not of the time, that is, it will be constant. 

In order to find equations for the average magnetic field H, we take the time average of the 
Maxwell equations 


10E 4 
div H = 0, curl H 2 - + j. 
c ôt c 
The first of these gives simply 
div H = 0. (43.1) 


In the second equation the average value of the derivative GE/dr, like the derivative of any 
quantity which varies over a finite range, is zero (cf. the footnote on p. 84). Therefore the 


second Maxwell equation becomes 


_ 4 
curl H = z j. (43.2) 


These two equations determine the constant field H. 
We introduce the average vector potential A in accordance with 
curl A = H. 


We substitute this in equation (43.2). We find 
— ,— 4 
grad div A- AA = - J. 
c 


But we know that the vector potential of a field is not uniquely defined, and we can impose 
an arbitrary auxiliary condition on it. On this basis, we choose the potential A so that 


div A = 0. (43.3) 
Then the equation defining the vector potential of the constant magnetic field becomes 
— 4 
AK=- <j, (43.4) 


It is easy to find the solution of this equation by noting that (43.4) is completely analogous 
to the Poisson equation (36.4) for the scalar potential of a constant electric field, where in 
place of the charge density o we here have the current density j/c. By analogy with the solution 
(36.8) of the Poisson equation, we can write 


_ fj 
=i | dav, (43.5) 


where R is the distance from the field point to the volume element dV. 

In formula (43.5) we can go over from the integral to a sum over the charges, by sub- 
stituting in place of j the product ov, and recalling that all the charges are pointlike. In this 
we must keep in mind that in the integral (43.5), R is simply an integration variable, and is 
therefore not subject to the averaging process. If we write in place of the integral 


f i dV, thesum Š, T, 


then R, here are the radius vectors of the various particles, which change during the motion 
of the charges. Therefore we must write - 
Ea a 


Lo 1 ey, 
A= È R,’ 


where we average the whole expression under the summation sign. 


(43.6) 
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Knowing A, we can also find the magnetic field, 


E = l 
H = curl A = curl - | Lav. 
cJ) R 


- 


The curl operator refers to the coordinates of the field point. Therefore the curl can be 
brought under the integral sign and j can be treated as constant in the differentiation. 
Applying the well known formula 


curl fa = f curl a + grad f x a, 


where f and a are an arbitrary scalar and vector, to the product j. 1/R, we get 


du besides xj ode 
R R R?’ 
and consequently, 
H=; [irav (43.7) 
cJ R? ' 


(the radius vector R is directed from dV to the field point). This is the /aw of Biot and 
Savart. 


§ 44. Magnetic moments 


Let us consider the average magnetic field produced by a system of charges in stationary 
motion, at large distances from the system. 

We introduce a coordinate system with its origin anywhere within the system of charges, 
just as we did in § 40. Again we denote the radius vectors of the various charges by r,, and 
the radius vector of the point at which we calculate the field by Ry. Then Ry —r, is the radius 
vector from the charge e, to the field point. According to (43.6), we have for the vector 
potential: 


AG y Ms (44.1) 


As in § 40, we expand this expression in powers of r,. To terms of first order (we omit the 
index a), we have 


EE YE i 
uu Le EIC Vir) 


In the first term we can write 


Ye-zYe. 


But the average value of the derivative of a quantity changing within a finite interval (like 
E er) is zero. Thus there remains for À the expression 


1 1 o—— 
(ova) gu 


We transform this expression as follows. Noting that v =f, we can write (remembering 


A 


li 
| 
C | 
mM 
D 
< 


§ 44 MAGNETIC MOMENTS 105 


that Ry is a constant vector) 
ld 1 
Y e(Ro rv = 53h Y err R9) z Y. e[v(r- R9) - r(v - Ro)]. 


Upon substitution of this expression in A, the average of the first term (containing the time 
derivative) again goes to zero, and we get 


1 
A= IR Y e[v(r- Rj) - r(v- R,)]. 
We introduce the vector 
1 

Hee Y erxv, (44.2) 
which is called the magnetic moment of the system. Then we get for A: 
= 3t. 
——— =V— xm (44.3) 
Knowing the vector potential, it is easy to find the magnetic field. With the aid of the 


formula 
curl (a x b) = (b: V)a — (a - V)b +a div b—b div a, 


we find 
H = curl A = curl ee = m div — S- (m: vi 
R3 R3 R3 
Furthermore, 
. Ro 1 p^ 
AN z Rovere patus div Bo =0 

and 

m. vy Re mo 3RQmCR, 

(MV) gs R go V)Ro+Ro(im- Vn7m- UE 
Thus, 


3n(m-n)—-m 


44.4 
RB (44.4) 


where n is again the unit vector along Ry. We see that the magnetic field is expressed in terms 
of the magnetic moment by the same formula by which the electric field was expressed in 
terms of the dipole moment [see (40.8)]. 

If all the charges of the system have the same ratio of charge to mass, then we can write 


mas Dery v=5— ym. 


If the velocities of all the charges v < c, then mv is the momentum p of the charge and we 
get 


(44.5) 


mg rx 


where M = Xrxp is the mechanical angular momentum of the system. Thus in this case, 
the ratio of magnetic moment to the angular momentum is constant and equal to e/2mc. 


, 


= Ine 
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PROBLEM 


Find the ratio of the magnetic moment to the angular momentum for a system of two charges 
(velocities v «c c). 

Solution: Choosing the origin of coordinates as the center of mass of the two particles we have 
mj ti mgr; = 0 and p; = —ps =p, where p is the momentum of the relative motion. With the 


aid of these relations, we find 
1 fe e mm 
maa (24 f) mms, 
2¢\ mi mj mtm 


§ 45. Larmor’s theorem 


Let us consider a system of charges in an external constant uniform magnetic field. 
The time average of the force acting on the system, 


Fay fixH= Sy fexn, 


is zero, as is the time average of the time derivative of any quantity which varies over a 
finite range. The average value of the moment of the forces is 


K =) 5 @xwxH) 


and is different from zero. It can be expressed in terms of the magnetic moment of the system, 


by expanding the vector triple product: 
e e 1. d 
=)- :H)- g-5- :H)-  H — E? j. 
K= s {v(r-H)—H(v-r)} Lz ie ) Hr j 


The second term gives zero after averaging, so that 
——— | — LL 
K = y= vw) = > L e H) -rv H) 
c 


[the last transformation is analogous to the one used in deriving (44.3)], or finally 
K — m xH. (45.1) 
We call attention to the analogy with formula (42.6) for the electrical case. 


The Lagrangian for a system of charges in an external constant uniform magnetic field 
contains (compared with the Lagrangian for a closed system) the additional term 


e e e 
L = - A: = — "y= — . . 
H Y- v Y Hry Y x) H (45.2) 
{where we have used the expression (19.4) for the vector potential of a uniform field]. 
Introducing the magnetic moment of the system, we have: 
Ly = m-H. (45.3) 


We call attention to the analogy with the electric field; in a uniform electric field, the 
Lagrangian of a system of charges with total charge zero contains the term 


Lg = d . E, 
which in that case is the negative of the potential energy of the charge system (see § 42). 
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We now consider a system of charges performing a finite motion (with velocities v « c) 
in the centrally symmetric electric field produced by a certain fixed charge. We transform 
from the laboratory coordinate system to a system rotating uniformly around an axis 
passing through the fixed particle. From the well-known formula, the velocity v ofthe particle 
in the new coordinate system is related to its velocity v' in the old system bv the relation 


v =v+QOxr, 


where r is the radius vector of the particle and Q is the angular velocity of the rotating co- 
ordinate system. In the fixed system the Lagrangian of the system of charges is 


mv"? 


L-Y--U. 


where U is the potential energy of the charges in the external field plus the energy of their 
mutual interactions. The quantity U is a function of the distances of the charges from the 
fixed particle and of their mutual separations; when transformed to the rotating system it 
obviously remains unchanged. Therefore in the new system the Lagrangian is 


m 
Le 3 (vq xp'-U. 
Let us assume that all the charges have the same charge-to-mass ratio e/m, and set 


Q= H. (45.4) 
2mc 


Then for sufficiently small H (when we can neglect terms in H?) the Lagrangian becomes: 


mo? | 
Lex tae et v— U. 


We see that it coincides with the Lagrangian which would have described the moticn of the 
charges in the laboratory system of coordinates in the presence of a constant magnetic field 
(see (45.2)). 

Thus we arrive at the result that, in the nonrelativistic case, the behavior of a system of 
charges all having the same e/m, performing a finite motion in a centrally symmetric electric 
field and in a weak uniform magnetic field H, is equivalent to the behavior of the same system 
of charges in the same electric field in a coordinate system rotating uniformly with the angular 
velocity (45.4). This assertion is the content of the Larmor theorem, and the angular velocity 
Q = eH/2mc is called the Larmor frequency. 

We can approach this same problem from a different point of view. If the magnetic field 
H is sufficiently weak, the Larmor frequency will be small compared to the frequencies of the 
finite motion of the system of charges. Then we may consider the averages, over times small 
compared to the period 2z/Q, of quantities describing the system. These new quantities will 
vary slowly in time (with frequency Q). 

Let us consider the change in the average angular momentum M of the system. According 
to a well-known equation of mechanics, the derivative of M is equal to the moment K of 
the forces acting on the system. We therefore have, using (45.1): 

dM 


If the e/m ratio is the same for all particles of the system, the angular momentum and 
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magnetic moment are proportional to one another, and we find by using formulas (44.5) 
and (45.4): 
dM 


a = 79M. (45.5) 


This equation states that the vector M (and with it the magnetic moment iii) rotates with 
angular velocity — Q around the direction of the field, while its absolute magnitude and the 
angle which it makes with this direction remain fixed. (This motion is called the Larmor 
precession.) 


CHAPTER 6 


ELECTROMAGNETIC WAVES 


§ 46. The wave equation 


The electromagnetic field in vacuum is determined by the Maxwell equations in which we 
must set p = 0, j = 0. We write them once more: 


curl E= -z> divH=0, (46.1) 


div E = 0. (46.2) 


These equations possess nonzero solutions. This means that an electromagnetic field can 
exist even in the absence of any charges. 

Electromagnetic fields occurring in vacuum in the absence of charges are called electro- 
magnetic waves. We now take up the study of the properties of such waves. 

First of all we note that such fields must necessarily be time-varying. In fact, in the 
contrary case, 0H/dt = ðE/ðt = 0 and the equations (46.1) and (46.2) go over into the 
equations (36.1), (36.2) and (43.1), (43.2) of a constant field in which, however, we now 
have p = 0, j = 0. But the solution of these equations which is given by formulas (36.8) and 
(43.5) becomes zero for p = 0, j = 0. 

We derive the equations determining the potentials of electromagnetic waves. 

As we already know, because of the ambiguity in the potentials we can always subject 
them to an auxiliary condition. For this reason, we choose the potentials of the electro- 
magnetic wave so that the scalar potential is zero: 


o=0. (46.3) 
Then 
1 0A 
E--—-—, H = curl A. (46.4) 
c ôt 
Substituting these two expressions in the first of equations (46.2), we get 
167A 
curl curl A= — AA + grad div A = — (46.5) 


c 
Despite the fact that we have already imposed one auxiliary condition on the potentials, 
the potential A is still not completely unique. Namely, we can add to it the gradient of an 
arbitrary function which does not depend on the time (meantime leaving ọ unchanged). 
In particular, we can choose the potentials of the electromagnetic wave so that 
div A =0. (46.6) 
109 
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In fact, substituting for E from (46.4) in div E = 0, we have 


div t == diva =0, 

that is, div A is a function only of the coordinates. This function can always be made zero 
by adding to A the gradient of a suitable time-independent function. 

The equation (46.5) now becomes : 

1 0*A 

SMS c? ôt? 
This is the equation which determines the potentials of electromagnetic waves. It is called 
the d' Alembert equation, or the wave equation. 

Applying to (46.7) the operators curl and 0/0t, we can verify that the electric and magnetic 
fields E and H satisfy the same wave equation. 

We repeat the derivation of the wave equation in four-dimensional form. We write the 
second pair of Maxwell equations for the field in the absence of charges in the form 


= 0. (46.7) 


ir 
ôx“ 
(This is equation (30.2) with j' = 0.) Substituting F'*, expressed in terms of the potentials, 
a 6A OA! 
"Ox Ox, 
we get 
^A. PA 


0. (46.8) 


Gx;Ox* Ox, dx" 
We impose on the potentials the auxiliary condition: 
ðA" 
exi 
(This condition is called the Lorentz condition, and potentials that satisfy it are said to be in 
the Lorentz gauge.) Then the first term in (46.8) drops out and there remains 
A _ u GA! 


=0. (46.9) 


= = 46.10 
PETET, (A60) 
This is the wave equation expressed in four-dimensional form.1 
In three-dimensional form, the condition (46.9) is: 
106 
-— +div A =0. 46. 
c ôt TEM Gorn 


It is more general than the conditions $ = 0 and div A = 0 that were used earlier; potentials 
that satisfy those conditions also satisfy (46.11). But unlike them the Lorentz condition has 
a relativistically invariant character: potentials satisfying it in one frame satisfy it in any 
other frame (whereas condition (46.6) is generally violated if the frame is changed). 


+ The wave equation is sometimes written in the form (JA = 0, where 


is called the d’Alembertian operator. 


$ It should be mentioned that the condition (46.9) still does not determine the choice of the potentials 
uniquely. We can add to A a term grad f, and subtract a term I/c (cf/ct) from ¢, where the function f is 
not arbitrary but must satisfy the wave equation D f — 0. 
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$ 47. Plane waves 
We consider the special case of electromagnetic waves in which the field depends only on 


one coordinate, say x (and on the time). Such waves are said to be plane. In this case the 
equation for the field becomes 


de a em: (47.1) 


where by fis understood any component of the vectors E or H. 
To solve this equation, we rewrite it in the form 


Q Q fa] Ó 
(-*3:) (atea) 


x x 
$-t—-, n-trt- 
[4 [4 


so that 1 =4(n + E), x -50 — č). Then 


and introduce new variables 


0 1/2 0 0 1/2 ð 
ae aralan) 
so that the equation for f becomes 
af E 
den 
The solution obviously has the form f = /,(€)+/2(y), where f, and f; are arbitrary functions. 
Thus 
fe. (:- *) +h c J (47.2) 


Suppose, for example, f; = 0, so that 


1-4 (-3). 


Let us clarify the meaning of this solution. In each plane x = const, the field changes with 
the time; at each given moment the field is different for different x. It is clear that the field 
has the same values for coordinates x and times t which satisfy the relation t — (x/c) = const, 
that is, 

x = const 4- ct. 


This means that if, at some time t = 0, the field at a certain point x in space had some 

definite value, then after an interval of time t the field has that same value at a distance ct 

along the X axis from the original place. We can say that all the values of the electromagnetic 

field are propagated in space along the X axis with a velocity equal to the velocity of light, c. 
Thus, 


represents a plane wave moving in the positive direction along the X axis. It is easy to show 


that 
x 
PAN cvs 


represents a wave moving in the opposite, negative, direction along the X axis. 
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In § 46 we showed that the potentials of the electromagnetic wave can be chosen so that 
$ = 0, and div A = 0. We choose the potentials of the plane wave which we are now con- 
sidering in this same way. The condition div A = 0 gives in this case 


Ox 


since all quantities are independent of y and z. According to (47.1) we then have also 
07 A Jt? = 0, that is, 04,/0t = const. But the derivative QA/0t determines the electric field, 
and we see that the nonzero component A, represents in this case the presence of a constant 
longitudinal electric field. Since such a field has no relation to the electromagnetic wave, we 
can set A, = 0. 

Thus the vector potential of the plane wave can always be chosen perpendicular to the X 
axis, 1.e. to the direction of propagation of that wave. 

We consider a plane wave moving in the positive direction of the X axis; in this wave, all 
quantities, in particular also A, are functions only of t—(x/c). From the formulas 


E2--—, H = curl A, 
c ô 
we therefore obtain 
1 1 
B= — 2A He vxA e V(r- 2) xa e c nx (47.3) 


where the prime denotes differentiation with respect to t—(x/c) and n is a unit vector along 
the direction of propagation of the wave. Substituting the first equation in the second, we 
obtain 

H=nxE. (47.4) 


We see that the electric and magnetic fields E and H of a plane wave are directed perpen- 
dicular to the direction of propagation of the wave. For this reason, electromagnetic waves 
are said to be transverse. From (47.4) it is clear also that the electric and magnetic fields of 
the plane wave are perpendicular to each other and equal to each other in absolute value. 

The energy flux in the plane wave, i.e. its Poynting vector is 


=- ExH = 7 Ex(nxE), 


and since E-n = 0, 
S-—— E'n- < Hm. 
4n T 


Thus the energy flux is directed along the direction of propagation of the wave. Since 
1 E? 

W = — (E? + H°) = — 

an | TH) 4n 


is the energy density of the wave, we can write 
S = cWn, (47.5) 
in accordance with the fact that the field propagates with the velocity of light. 
The momentum per unit volume of the electromagnetic field is S/c?. For a plane wave this 
gives (W/c)n. We call attention to the fact that the relation between energy W and momen- 


tum W/c for the electromagnetic wave is the same as for a particle moving with the velocity 
of light [see (9.9)]. 
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The flux of momentum of the field is determined by the components o,, of the Maxwell 
stress tensor (33.3). Choosing the direction of propagation of the wave as the Y axis, we 
find that the only nonzero component of 7% is 


T" = —o,,=W. (47.6) 
As it must be, the flux of momentum is along the direction of propagation of the wave, and 
is equal in magnitude to the energy density. 
Let us find the law of transformation of the energy density of a plane electromagnetic 
wave when we change from one inertial reference system to another. To do this we start 
from the formula 


l vV, ve 
w-— (meme) 
)1—— 


c? 


(see the problem in $ 33) and must substitute 

S, 2 cW' cosa',0,, = —W'cos?a', 
where a' is the angle (in the K’ system) between the X’ axis (along which the velocity V 
is directed) and the direction of propagation of the wave. We find: 


V 2 
c = os a’ 
i E 


W-W LL (47.7) 


Since W = E?/4x = H ?/An, the absolute values of the field intensities in the wave. trans- 
form like y W. 


PROBLEMS 


1, Determine the force exerted on a wall from which an incident plane electromagnetic wave is 
reflected (with reflection coefficient R). 
Solution: The force f acting on unit area of the wall is given by the flux of momentum through 
this area, i.e., it is the vector with components 
fam Op Ng — Cag Ns, 
where N is the vector normal to the surface of the wall, and oq and oJ, are the components of the 
energy-momentum tensors for the incident and reflected waves. Using (47.6), we obtain: 
f= Wa(N -n)-+ W'n'(N n^. 
From the definition of the reflection coefficient, we have: W’ = RW. Also introducing the angle 
of incidence 0 (which is equal to the reflection angle) and writing out components, we find the 
normal force ("light pressure") 
fu = W(-- R) cos? 8 
and the tangential force 
f. = W(1—R)sin 8 cos 0. 


2. Use the Hamilton-Jacobi method to find the motion of a charge in the field of a plane electro- 
magnetic wave with vector potential Afr — (x/c)]. 
Solution: We write the Hamilton-Jacobi equation in four-dimensional form: 


è e 
e (Ee ca) (Get in) me o 
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The fact that the field is a plane wave means that the A‘ are functions of one independent variable, 
which can be written in the form & = kf xt, where k' is aconstant four-vector with its square equal 
to zero, k, k! = 0 (see the following section). We subject the potentials to the Lorentz condition 


04! dA! 

dx! dt 

for the variables field this is equivalent to the condition A'k, = 0. 
We seek a solution of equation (1) in the form 


S= —fixt FQ, 


where f'—(/9,f) is a constant vector satisfying the condition fi f! = nm?c? (S = —f,x' is the 
solution of the Hamilton-Jacobi equation for a free particle with four-momentum p! = f). Sub- 
stitution in (1) gives the equation 


k, — 0; 


where the constant y = k, ft. Having determined F from this equation, we get 
=f 5 [aate Es È aua. Q) 
cy 2yc? 


Changing to three-dimensional notation with a fixed reference frame, we choose the direction of 
propagation of the wave as the x axis. Then £ = ct—x, while the constant y = f°—f+. Denoting 
the two-dimensional vector fy, f; by x, we find from the condition f, f! = (f°)? —(f?)? —x? = m?c?, 

mic?4- x? 
fot =, 
y 
We choose the potentials in the gauge in which ¢ = 0, while A(¢) lies in the yz plane. Then equation 
(2) takes the form: 
4 mee e? 
Sswer tte ta) a E 4 — [x Adt — —3 | Ad. 
2 2y cy 2yc 

According to the general rules (Mechanics, $ 47), to determine the motion we must equate the 
derivatives 0S/ók, 0S/0y to certain new constants, which can be made to vanish by a suitable choice 
of the coordinate and time origins. We thus obtain the parametric equations in £: 


1 e 1 e 
»-istC-E f aae, impe | ntn 
1 [me x? e e? 2 — 
(RT -1) a [x Adet sers | nca ct — dd x. 


The generalized momentum P = p+(e/c)A and the energy 4 are found by differentiating the 
action with respect to the coordinates and the time; this gives: 


x 


e e 
Py = Ky 7 z Ay Pz = Ka 0s 
y,mc-ctgs e e 
=— > — - k: A4 — A’, 
Pr 3 2y c ub 
4 = (y+ pic. 


If we average these over the time, the terms of first degree in the periodic function A(Z) vanish. 
We assume that the reference system has been chosen so that the particle is at rest init on the average, 
i.e. so that its averaged momentum is zero. Then 


K —0, y 


§ 48 MONOCHROMATIC PLANE WAVES 115 


The final formulas for determining the motion have the form: 
xe [ra »--t[.« 2=-£[ Ade 
22c , y= cy yes, 3 RT zy, 


e — 
ct — 6&4 J l (A?—A?)dé; (3) 


e? — e e 
P: = IË (A?—A?), Pym có Áy, Pz = — z Az 


2 coe 
8 = cyt 5 (AT-AD. (4) 


§ 48. Monochromatic plane waves 


A very important special case of electromagnetic waves is a wave in which the field is a 
simply periodic function of the time. Such a wave is said to be monochromatic. All quantities 
(potentials, field components) in a monochromatic wave depend on the time through a 
factor of the form cos (wf+«). The quantity o is called the cyclic frequency of the wave (we 
shall simply call it the frequency). 

In the wave equation, the second derivative of the field with respect to the time is now 
a*f/at? = —w*f, so that the distribution of the field in space is determined for a mono- 
chromatic wave by the equation 


2 
Af = f=0. (48.1) 


In a plane wave (propagating along the x axis), the field is a function only of s—(x/c). 
Therefore, if the plane wave is monochromatic, its field is a simply periodic function of 
t—(x/c). The vector potential of such a wave is most conveniently written as the real part of 


a complex expression: 
— iol =x) 
A - Rela, e 7679) (48.2) 
Here Ag is a certain constant complex vector. Obviously, the fields E and H of such a 
wave have analogous forms with the same frequency c. The quantity 


2 
ja (48.3) 
[o 
is called the wavelength; it is the period of variation of the field with the coordinate x at a 
fixed time t. 
'The vector 
k-7n (48.4) 
c 
(where n is a unit vector along the direction of propagation of the wave) is called the wave 
vector. In terms of it we can write (48.2) in the form 


A = Re(Ag e 1790), (48.5) 


which is independent of the choice of coordinate axes. The quantity which appears multiplied 
by i in the exponent is called the phase of the wave. 
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So long as we perform only linear operations, we can omit the sign Re for taking the real 

part, and operate with complex quantities as such.f Thus, substituting 
A = Agel re) 
in (47.3), we find the relation between the intensities and the vector potential of a plane 
monochromatic wave in the form 
E = ikA, H- ikxA. (48.6) 

We now treat in more detail the direction of the field of a monochromatic wave. To be 

specific, we shall talk of the electric field 
E = RefE, 2179) 

(everything stated below applies equally well, of course, to the magnetic field). The quantity 


E, is a certain complex vector. Its square E? is (in general) a complex number. If the argument 
of this number is — 2a (i.e. Ej = |Eo|e 7^), the vector b defined by 


E, = be ^ (48.7) 
will have its square real, b? = |Eo|?. With this definition, we write: 
E = Re[be/& r7 er72, (48.8) 
We write b in the form 
b = b, + ib;, 


where b, and b, are real vectors. Since b? = b2—b2--2ib,: b, must be a real quantity, 
b, -b, = 0, i.e. the vectors b, and b, are mutually perpendicular. We choose the direction 
of b, as the y axis (and the x axis along the direction of propagation of the wave). We then 
have from (48.8): 
E, = b, cos (wt—k-r+a), 
E, = +b, sin (wt-k-r+a), 
where we use the plus (minus) sign if b, is along the positive (negative) z axis. From (48.9) it 
follows that 


(48.9) 


i 


E] E 

pp 

Thus we see that, at each point in space, the electric field vector rotates in a plane perpen- 

dicular to the direction of propagation of the wave, while its endpoint describes the ellipse 

(48.10). Such a wave is said to be elliptically polarized. The rotation occurs in the direction 

of (opposite to) a right-hand screw rotating along the x axis, if we have the plus (minus) sign 
in (48.9). 

If b, = by, the ellipse (48.10) reduces to a circle, i.e. the vector E rotates while remaining 

constant in magnitude. In this case we say that the wave is circularly polarized. The choice 

of the directions of the y and z axes is now obviously arbitrary. We note that in such a wave 


=1, . (48.10) 


t If two quantities A(z) and B(1) are written in complex form 
A(t) = Age7™4, Bit) = Boe", 
then in forming their product we must first, of course, separate out the real part. But if, as it frequently 
happens, we are interested only in the time average of this product, it can be computed as 
4 Re {A-B*). 
In fact, we have: i 
Re A: ReB = j(Aye ^" + Af e") (Bue ^" + Br e“). 
When we average, the terms containing factors e*? vanish, so that we are left with 


Re A- ReB = (A, B! + A*-B,) = | Re(A-B*). 
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the ratio of the y and z components of the complex amplitude E, is 


Eo: ] 
E;, ti (48.11) 
for rotation in the same (opposite) direction as that of a right-hand screw (right and left 
polarizations). + 

Finally, if b, or b, equals zero, the field of the wave is everywhere and always parallel 
(or antiparallel) to one and the same direction. In this case the wave is said to be linearly 
polarized, or plane polarized. An elliptically polarized wave can clearly be treated as the 
superposition of two plane polarized waves. 


Now let us turn to the definition of the wave vector and introduce the four-dimensional 


wave vector with components 
ki = (s. K). (48.12) 
c 


That these quantities actually form a four-vector is obvious from the fact that we get a 
scalar (the phase of the wave) when we multiply by x’: 


k;x! = ot—k-r. (48.13) 


From the definitions (48.4) and (48.12) we see that the square of the wave four-vector is 
Zero: 


k'k; = 0. (48.14) 
This relation also follows directly from the fact that the expression 
AcAQe n 


must be a solution of the wave equation (46.10). 

As is the case for every plane wave, in a monochromatic wave propagating along the x 
axis only the following components of the energy-momentum tensor are different from zero 
(see § 47): 

T°? = T?! = T = W. 


By means of the wave four-vector, these equations can be written in tensor form as 
ik __ ijk 
T pv: kk. (48.15) 


Finally, by using the law of transformation of the wave four-vector we can easily treat the 
so-called Doppler effect—the change in frequency o of the wave emitted by a source moving 
with respect to the observer, as compared to the "true" frequency o of the same source in 
the reference system (Ko) in which it is at rest. 

Let V be the velocity of the source, i.e. the velocity of the K, system relative to K. Accord- 
ing to the general formula for transformation of four-vectors, we have: 


k9— E k? 


(the velocity of the K system relative to Kọ is —V). Substituting k° =a/c, 
k! =k cos a = oc cos a, where « is the angle (in the K system) between the direction of 


t We assume that the coordinate axes form a right-handed system. 
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emission of the wave and the direction of motion of the source, and expressing c in terms 
of wo, we obtain: 


(48.16) 


e 
les 
^ c^ 
Q-—9s-—V— - 


] — —cos x 
c 


This is the required formula. For V & c, and if the angle x is not too close to 7/2, it gives: 


p 
we My ( + — COs 2 (38.17) 
c 


For x = 7/2, we have: 
F- y? 
O = Da N ees. = wy | l-c (48.18) 


in this case the relative change in frequency is proportional to the square of the ratio Fic. 


PROBLEMS 


1. Determine the dircction and magnitude of the axes of the polarization ellipse in terms of the 
complex amplitude E». 
Solution: The problem consists in determining the vector b :- b, - ibe, whose square is real. We 
have from (48.7): " 
Eo E* = bibi Eo- EX = --2/b, > bo, (1) 
or 
bi bi AP B, biba = AB sin 6, 


where we have introduced the notat-on 


i f Eo: 204 
E; A, ‘Eo: ~ B, Fee ae AS 
B "a B^ Aa‘ 
for the absolute values of Ec, and £o: and for the phase difference 6 between them. Then 
2b, — /A°+ B+ 24B sind + J/ A^ + B —2AB sind, (2) 


from which we get the magnitudes of the semiaxes of the polarization ellipse. 
To determine their directions (relative to the arbitrary initial axes v and z) we start from the 
equality 
Re (Eo b, XE- biri- O, 


which is easily verified by substituting Es = (b; - ibe '*. Writing out this equality in the y, z co- 
ordinates, we get for the angle 0 between the direction of b, and the y axis: 
2AB cos ó 

A?-. p? 


The direction of rotation of the field is determined by the sign of the x component of the vector 
b, «by. Taking its expression from (1) 


* 
2i(by = bz); = Eos Et, — Ef. Eo, > z^ (2) - (zy e 
L Eo, Eo, 


we see that the direction of b, - b; (whether it is along or opposite to the positive direction of the 
x axis), and the sign of the rotation (whether in the same direction, or opposite to the direction of a 
right-hand screw along the x axis) are given by the sign of the imaginary part of the ratio Eo;/ Es, 
(plus for the first case and minus for the second). This is a generalization of the rule (48.11) for the 
case of circular polarization. 


tan 20 .- 


(3) 
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2. Determine the motion of a charge in the field of a plane monochromatic linearly polarized 
wave. 


Solution: Choosing the direction of the field E of the wave as the y axis, we write: 
E, = E = Eo cos wé, Ay =A =~ sin oof 
(€ =¢t—x/c), From formulas (3) and (4) of problem 2, § 47, we find (in the reference system in 


which the particle is at rest on the average) the following representation of the motion in terms of 
the parameter 7 = o£): 


B "Exe eE,c 
X= -gyo Sin 2m, yo? COS y, z=0, 
n e E 2 24,2 a 
t= 578r 3 sin 2, py-mct-x 
272 
[] eEo . 
Ps = — Fg 095 20 Py =—> sn, Ps = 0. 


The charge moves in the x, y plane in a symmetric figure-8 curve with its longitudinal axis along 
the y axis. During a period of the motion, » varies from 0 to 23. 


3. Determine the motion of a charge in the field of a circularly polarized wave. 
Solution: For the field of the wave we have: 


E, = Eo cos ox, E, = Eo sin wé, 


cEo . cE, 
Ay — — —* sin wé, A: = ——2 cos ctc. 
@ @ 
The motion is given by the formulas: 
ec ecEy . 
x=0, y= — — cos of, z= —— + sin of, 
yo yo 
eEo + eEo 
P:=0, Py = —- sin wt, Pz = — — cos ot, 
@ @ 
2fpa 
c?E 
y? -m*c--—.. 
@ 


Thus the charge moves in the y, z plane along a circle of radius ecEy/yw? with a momentum having 
the constant magnitude p = eE,/w; at each instant the direction of the momentum p is opposite to 
the direction of the magnetic field H of the wave. 


§ 49. Spectral resolution 


Every wave can be subjected to the process of spectral resolution, i.e. can be represented as 
a superposition of monochromatic waves with various frequencies. The character of this 
expansion varies according to the character of the time dependence of the field. 

One category consists of those cases where the expansion contains frequencies forming a 
discrete sequence of values. The simplest case of this type arises in the resolution of a purely 
periodic (though not monochromatic) field. This is the usual expansion in Fourier series; 
it contains the frequencies which are integral multiples of the "fundamental" frequency 
Wy = 2n/T, where T is the period of the field. We write it in the form 


f= È fem (49.1) 


(where f is any of the quantities describing the field). The quantities f, are defined in terms 
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of the function f by the integrals 


Tí2 
1 . 
fo f fD "dr. (49.2) 
-T/2 
Because f(t) must be real, 
fen =Se (49.3) 


In more complicated cases, the expansion may contain integral multiples (and sums of 
integral multiples) of several different incommensurable fundamental frequencies. 

When the sum (49.1) is squared and averaged over the time, the products of terms with 
different frequencies give zero because they contain oscillating factors. Only terms of the 
form f, f-n = |f,|? remain. Thus the average of the square of the field, i.e. the average 
intensity of the wave, is the sum of the intensities of its monochromatic components: 


P= Y IME [hl (49.4) 


(where it is assumed that the average of the function f over a period is zero, i.e. fy = f = 0). 

Another category consists of fields which are expandable in a Fourier integral containing 
a continuous distribution of different frequencies. For this to be possible,the function f(t ) 
must satisfy certain definite conditions; usually we consider functions which vanish for 
t — + oo. Such an expansion has the form 


iur. uel 
o= | foe S, (49.5) 
2n 
where the Fourier components are given in terms of the function f(t) by the integrals 
fo= | feat, (49.6) 
Analogously to (49.3), 
dac. (49.7) 


Let us express the total intensity of the wave, i.e. the integral of f? over all time, in terms 
of the intensity of the Fourier components. Using (49.5) and (49.6), we have: 


or, using (49.7), 


o a o ado oc do 
[ rae f vi 5 22 [IP = (49.8) 
=o -o 0 


§ 50. Partially polarized light 


Every monochromatic wave is, by definition, necessarily polarized. However we usually 
have to deal with waves which are only approximately monochromatic, and which contain 
frequencies in a small interval Aw. We consider such a wave, and let w be some average 
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frequency for it. Then its field (to be specific we shall consider the electric field E) at a fixed 

point in space can be written in the form l 
E,(t)e"'", 

where the complex amplitude E,(t) is some slowly varying function of the time (for a strictly 

monochromatic wave Ej would be constant). Since E, determines the polarization of the 

wave, this means that at each point of the wave, its polarization changes with time; such a 

wave is said to be partially polarized. 

The polarization properties of electromagnetic waves, and of light in particular, are 
observed experimentally by passing the light to be investigated through various bodiest 
and then observing the intensity of the transmitted light. From the mathematical point of 
view this means that we draw conclusions concerning the polarization properties of the light 
from the values of certain quadratic functions of its field. Here of course we are considering 
the time averages of such functions. 

Quadratic functions of the field are made up of terms proportional to the products £, Ep, 
EXE; or E,E;. Products of the form 

E Ep = abge 55. . BiEg = Et Bees, 
which contain the rapidly oscillating factors e* 7" give zero when the time average is taken. 
The products E,Ej = Eg, Eo, do not contain such factors, and so their averages are nct 
zero. Thus we see that the polarization properties of the light are completely characterized 
by the tensor 
Jap = Eo, Eos. (50.1) 

Since the vector E, always lies in a plane perpendicular to the direction of the wave, the 
tensor J,, has altogether four components (in this section the indices x, f are understood 
to take on only two values: a, 8 = 1, 2, corresponding to the v and z axes; the x axis is 
along the direction of propagation of the wave). 

The sum of the diagonal elements of the tensor J,, (we denote it by J) is a real quantity— 
the average value of the square modulus of the vector E, (or E): 


J =J,, = E,:E*. (50.2) 
This quantity determines the intensity of the wave, as measured by the energy flux density. 
To eliminate this quantity which is not directly related to the polarization properties, we 
introduce in place of J,, the tensor 


Jap 
Ee 50.3 
Pap J ( ) 
for which paa = 1; we call it the polarization tensor. 


From the definition (50.1) we see that the components of the tensor J,,, and consequently 
also pap, are related by l 

Pap = Pha (50.4) 

(i.e. the tensor is hermitian). Consequently the diagonal components p,, and p3, are real 

(with p,,+ 22 = 1) while p,, = p72. Thus the polarization is characterized by three real 


parameters. 
Let us study the conditions that the tensor p,, must satisfy for completely polarized light. 
In this case Ej = const, and so we have simply 


Jag = J Pop = Es, Eos (50.5) 


* For example, through a Nicol prism. 
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(without averaging). i.e. the components of the tensor can be written as products of com- 
ponents of some constant vector. The necessary and sufficient condition for this is that the 
determinant vanish: 


Pan! = Pir P22 Pi P2 = 9. (50.6) 
The opposite case is that of unpolarized or natural light. Complete absence of polarization 
means that all directions (in the v, plane) are equivalent. In other words the polarization 
tensor must have the form: 
Pap = May (50.7) 
The determinant is [Pas] = 1. 
In the general case of arbitrary polarization the determinant has values between Q and 1. 
By the degree of polarization we mean the positive quantity P, defined from 
(Panl = dE — P^). (50.8) 
It runs from the value 0 for unpolarized to | for polarized light. 
An arbitrary tensor p,5 can be split into two parts—a symmetric and an antisymmetric 
part. Of these, the first 
S, = Pap py) 
is real because of the hermiticity of p,,. The antisymmetric part is pure imaginary. Like any 
antisymmetric tensor of rank equal to the number of dimensions, it reduces to a pseudo- 
scalar (sec the footnote on p. 17): 
HR = -i eA, 


where 4 is a real pseudoscalar. e,, 1s the unit antisymmetric tensor (with components 
€,; = —0;, = I). Thus the polarization tensor has the form: 
i 
Pap = Sap z 04. Sap = Spas (50.9) 


i.e. it reduces to one real symmetric tensor and one pseudoscalar. 
For a circularly polarized wave, the vector E, = const, where 
Eo; = tiEq,. 

It is easy to see that then S,,=36,,, while A = +1. On the other hand, for a linearly 
polarized wave the constant vector E, can be chosen to be real, so that A = 0. In the general 
case the quantity 4 may be called the degree of circular polarization; it runs through values 
from +1 to — 1, where the limiting values: correspond to right- and left-circularly polarized 
waves, respectively. 

The real symmetric tensor S,,. like any symmetric tensor. can be brought to principal 
axes, with different principal values which we denote by 4, and 4,. The directions of the 
principal axes are mutually perpendicular. Denoting the unit vectors along these directions 
by a!) and n'?), we can write S,, in the form: 


Sap = A ni ny AM nye, 2,4+4, = 1. (50.10) 


The quantities 4, and 2; are positive and take on values from 0 to 1. 


+ The fact that the determinant is positive for any tensor of the form (50.1) is easily seen by considering 
the averaging. for simplicity. as a summation over discrete values, and using the well-known algebraic 
inequality 

{Dox de D lel? E pal’, 
ub c b 
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Suppose that A = 0, so that py, = Sap. Each of the two terms in (50.10) has the form of 


a product of two components of a constant vector (V2,;n™ or 4/2,n'?)). In other words, 
each of the terms corresponds to linearly polarized light. Furthermore, we see that there 
is no term in (50.10) containing products of components of the two waves. This means that 
the two parts can be regarded as physically independent of one another, or, as one says, they 
are incoherent. Yn fact, if two waves are independent, the average value of the product 
EE?) is equal to the product of the averages of each of the factors, and since each of them 
is Zero, 
EWE?) = 0. 


Thus we arrive at the result that in this case (A = 0) the partially polarized light can be 
represented as a superposition of two incoherent waves (with intensities proportional to 
2, and 23), linearly polarized along mutually perpendicular directions.T (In the general case 
of a complex tensor p,, one can show that the light can be represented as a superposition of 
two incoherent elliptically polarized waves, whose polarization ellipses are similar and 
mutually perpendicular (see problem 2).) 

Let $ be the angle between the axis 1 (the y axis) and the unit vector n(!?; then 


n()-(cosó,sinó),  n® = (—sin $,cos $). 


Introducing the quantity / = 4, — 4; (assume 4, >å), we write the components of the 
tensor (50.10) in the following form: 


ze I sin 26 ) 
#7 2X Isin2d 1—-1cos26/ 


Thus, for an arbitrary choice of the axes y and z, the polarization properties of the wave can 
be characterized by the following three real parameters: A—the degree of circular polariza- 
tion, /—the degree of maximum linear polarization, and $—the angle between the direction 
n(! of maximum polarization and the y axis. 

In place of these parameters one can choose another set of three parameters: 


17 sin 26, č, = A, E, = l cos 26 (50.12) 
(the Stokes parameters). The polarization tensor is expressed in terms of them as 


d ( 1+č3 E 
Pas ode. 1-8 
All three parameters run through values from —1 to +1. The parameter č, characterizes 
the linear polarization along the y and z axes: the value čą = I corresponds to complete 
linear polarization along the y axis, and č, = — 1 to complete polarization along the =z axis. 
The parameter £, characterizes the linear polarization along directions making an angle of 
45° with the y axis: the value č, = 1 means complete polarization at an angle $ = 7/4. 
while č = —1 means complete polarization at ¢ = —z/4.1 


(50.11) 


(50.13) 


t The determinant [Sas] = 4:42; suppose that 4, > 22; then the degree of polarization, as defined in 
(50.8), is P = 1 —24,. In the present case (4 = 0) one frequently characterizes the degree of polarization by 
using the depolarization coefficient, defined as the ratio 45/4. 

i For a completely elliptically polarized wave with axes of the ellipse b; and bz (see § 48), the Stokes 
parameters are: 

[4 EE Wy €2 = 2b; bz, £3 = b? —b3. 
Here the y axis is along bi, while the two signs in £; correspond to directions of bz along and opposite to the 
direction of the z axis. 
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The determinant of (50.13) is equal to 
les;| = 4(1 - č} - 83-3). (50.14) 
Comparing with (50.8), we see that 


P = VE +E +A. (50.15) 


Thus, for a given overall degree of polarization P, different types of polarization are 
possible, characterized by the values of the three quantities €,, č2, €3, the sum of whose 
squares is fixed; they form a sort of vector of fixed length. 


We note that the quantities č, = A and /é?+&2 = | are invariant under Lorentz trans- 
formations. This remark is already almost obvious from the very meaning of these quantities 
as degrees of circular and linear polarization. t 


PROBLEMS 


1. Resolve an arbitrary partially polarized light wave into its "natural" and "polarized" parts. 
Solution: This resolution means the representation of the tensor Jz, in the form 
Jag =OS t ERE”. 
The first term corresponds to the natural, and the second to the polarized parts of the light. To 
determine the intensities of the parts we note that the determinant 
[Jas — 49 00,5 = |EPER*| =0. 
Writing Jas =Jpag in the form (50.13) and solving the equation, we get 
Jo =J(1—P). 
The intensity of the polarized part is J® = |E®|2 =J—J™ = JP, 

The polarized part of the light is in general an elliptically polarized wave, where the directions 
of the axes of the ellipse coincide with the principal axes of the tensor Sys. The lengths 5, and b; of 
the axes of the ellipse and the angle ¢ formed by the axis b, and the y axis are given 
by the equations: 

b?+bł=JP, 2b,b,=JPé,, tan2¢ -$ 
3 

2. Represent an arbitrary partially polarized wave as a superposition of two incoherent ellip- 
tically polarized waves. 

Solution: For the hermitian tensor pas the “principal axes” are determined by two unit complex 
vectors n (n-n* = 1), satisfying the equations 

pas Mg = Ang. (1) 
The principal values 4, and 45 are the roots of the equation 
lpas — 4óa5] = 0. 
Multiplying (1) on both sides by n*, we have: 


1 ——— 
A= pag Ng Ng = ET 


from which we see that 21, 42 are real and positive. Multiplying the equations 


1 
Pag n — Ay n», Pis noe = Ag n2» 


t For a direct proof, we note that since the field of the wave is transverse in any reference frame, it is clear 
from the start that the tensor pas remains two-dimensional in any new frame. The transformation of pas into 
Pas leaves unchanged the sum of absolute squares pas p*, (in fact, the form of the transformation does not 
depend on the specific polarization properties of the light, while for a completely polarized wave this sum 
is 1 in any reference system). Because this transformation is real, the real and imaginary parts of the tensor 
Pas (50.9) transform independently, so that the sums of the squares of the components of each separately 
remain constant, and are expressed in terms of / and A. 
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for the first by n?" and for the second by n/", taking the difference of the results and using the 
hermiticity of p,,, we get: 
(Ay — Ani? = 0. 
It then follows that n?? - n2* = 0, i.e. the unit vectors n? and nt?” are mutually orthogonal. 
The expansion of the wave is provided by the formula 
pag = Ay rn P -- AS nOn, 

One can always choose the complex amplitude so that, of the two mutually perpendicular com- 
ponents, one is real and the other imaginary (compare § 48). Setting 

n? = by, nD = ib; 
(where now b, and bz are understood to be normalized by the condition 52-- b2— 1), we get from 
the equation n™ : n(2* = 0: 

n? = iba, n2 = by. 


We then see that the ellipses of the two elliptically polarized vibrations are similar (have equal axis 
ratio), and one of them is turned through 90° relative to the other. 


3. Find the law of transformation of the Stokes parameters for a rotation of the y, z axes through 
an angle ¢. 

Solution: The law is determined by the connection of the Stokes parameters to the components of 
the two-dimensional tensor in the yz plane, and is given by the formulas 


€, = 6&1 cos 26 — 65 sin 2¢, č; = č, sin 2¢+ £5 cos 2¢, ča = 6s. 


§ 51. The Fourier resolution of the electrostatic field 


The field produced by charges can also be formally expanded in plane waves (in a Fourier 
integral). This expansion, however, is essentially different from the expansion of electro- 
magnetic waves in vacuum, for the field produced by charges does not satisfy the homo- 
geneous wave equation, and therefore each term of this expansion does not satisfy the 
equation. From this it follows that for the plane waves into which the field of charges can be 
expanded, the relation k^ = w?/c?, which holds for plane monochromatic electromagnetic 
waves, is not fulfilled. 

In particular, if we formally represent the electrostatic field as a superposition of plane 
waves, then the "frequency" of these waves is clearly zero, since the field under consideration 
does not depend on the time. The wave vectors themselves arc, of course, different from zero. 

We consider the field produced by a point charge e, located at the origin of coordinates. 
The potential $ of this field is determined by the equation (see $ 36) 


Ag = —4ned(r). (51.1) 
We expand ó in a Fourier integral, i.e., we represent it in the form 
+a 
i d*k 
= iker 51.2 
om [tn (51.2) 


where d?k denotes dk, dk, dk,. In this formula ¢, = f d(r)e "* "dV. Applying the Laplace 
operator to both sides of (51.2), we obtain 


+o 


Ag =- i ketto, 


-œ 


dk 
Qn)" 
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so that the Fourier component of the expression A is 
(Ad) = -K^&. 
On the other hand, we can find (A@), by taking Fourier components of both sides of 
equation (51.1), 


(Ady=— [ 4xed(rye"* dV = —4ne. 
Equating the two expressions obtained for (A@),, we find 
4ne 
dy = ur (51.3) 
This formula solves our problem. 
Just as for the potential ġ, we can expand the field 
+a 
Qu Kk 
E= E,e*'' ——. 51.4 
| 5s oe 
With the aid of (51.2), we have 
to 
ac, aK ac, Kk 
E= — grad { $,e* Qi - f ik bye Qm* 
Comparing with (51.4), we obtain 
. . Anek 
E, = — ikó, = l Uu (51.5) 


From this we see that the field of the waves, into which we have resolved the Coulomb field, 
is directed along the wave vector. Therefore these waves can be said to be /ongitudinal. 


§ 52. Characteristic vibrations of the field 


We consider an electromagnetic field (in the absence of charges) in some finite volume of 
space. To simplify further calculations we assume that this volume has the form of a rect- 
angular parallelepiped with sides A, B. C, respectively. Then we can expand all quantities 
characterizing the field in this parallelepiped in a triple Fourier series (for the three co- 
ordinates). This expansion can be written (e.g. for the vector potential) in the form: 


A=) Ae (52.1) 
k 


explicitly indicating that A is real. The summation extends here over all possible values of 
the vector k whose components run through the values 


2an, 2nn 2nn. 

=, = ——, k. = — =, §2.2 

* A , B C (52.2) 
where n,. #,, n, are positive or negative integers. Since A is real, the coefficients in the expan- 
sion (52.1) are related by the equations A_, = Af. From the equation div A = 0 it follows 
that for each k, 


k k 


k-A, = 0. (52.3) 


i.e.. the complex vectors A, are "perpendicular" to the corresponding wave vectors k. The 
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vectors A, are, of course, functions of the time; from the wave equation (46.7), they satisfy 
the equation . 
ALT CK A, = 0. (52.4) 

If the dimensions A, B, C of the volume are sufficiently large, then neighboring values 
of kx, ky, k, (for which ny, n, n, differ by unity) are very close to one another. In this case 
we may speak of the number of possible values of k,, ky, k, in the small intervals Ak,, Ak,, 
Ak.. 

Since to neighboring values of, say, kx, there correspond values of n, differing by unity, 
the number An, of possible values of k, in the interval Ak, is equal simply to the number of 
values of n, in the corresponding interval. Thus, we obtain 


A B C 
An, = — Ak, | An,—- — Ak, An, =— AK, 
ip" inp" "= n 
The total number An of possible values of the vector k with components in the intervals 
Ak,, Ak,, Ak, is equal to the product An, An, An,, that is, 


An= a Gn Ak, Ak, Ak,, (52.5) 
where V = ABC is the volume of the field. It is easy to determine from this the number of 
possible values of the wave vector having absolute values in the interval Ak, and directed 
into the element of solid angle Ao. To get this we need only transform to polar coordinates 
in the "& space" and write in place of Ak „Ak Ak. the element of volume in these coordinates. 
Thus 

2 
An — Qn asy k* Ak ^o. (52.6) 
Replacing Ao by 4z, we find the number of possible values of k with absolute value in 
the interval AA and pointing in all directions: An = (V/2n?)k? Ak. 
We calculate the total energy 


l 
é= z [E+ 
8n 


of the field, expressing it in terms of the quantities A,. For the electric and magnetic fields 


we have i i 
E = —-AÀ = —- Y Aet, 
c z2 k 


H = curl A =i) (k x Ae". 20 (82.7) 
k 


When calculating the squares of these sums, we must keep in mind that all products of 
terms with wave vectors k and k’ such that kk’ give zero on integration over the whole 
volume. In fact, such terms contain factors of the form e***?*, and the integral, e.g. of 


i a nxx 
f e dx, 
(0) 
with integer n, different from zero, gives zero. In those terms with k' = —k, the exponentials 
drop out and integration over dV gives just the volume V. 
As a result, we obtain 
4 


4-6) io^ AE + (k x Ay): (x AD]. 
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From (52.3). we have ; 
(kx AL) (Kx AL) = K Ap Az. 
so that 


b 


V D. 
-— 2. (Ap AË HARTA ALL (52.8) 
Each term of this sum corresponds to one of the terms of the expansion (52.1). 

Because of (52.4). the vectors A, are harmonic functions of the time with frequencies 
w, = ck. depending only on the absolute value of the wave vector. Depending on the 
choice of these functions, the terms in the expansion (52.1) can represent standing or running 
plane waves. We shall write the expansion so that its terms describe running waves. To do 
this we write it in the form 

= Y (aye! + ate!) (52.9) 
k 
which explicitly exhibits that A is real, and each of the vectors a, depends on the time 
according to the law 
a,-e "I". ay ck. (52.10) 
Then each individual term in the sum (52.9) will be a function only of the difference 
k:r —cj,f. which corresponds to a wave propagating in the k direction. 

Comparing the expansions (52.9) and (52.1), we find that their coefficients are related 

by the formulas 
A, = a,+a*,. 


and from (52.10) the time derivatives are related by 
A, = —ick(a,—a*,). 


Substituting in (52.8), we express the field energy in terms of the coefficients of the expansion 

(52.9). Terms with products of the form a,-a_, or az-a* , cancel one another; also noting 

that the sums Y'a, aj and Ya. a. ; differ only in the labeling of the summation index. 

and therefore coincide. we finally obtain: 

" k?y 

E=] Ene 6-——aa. (52.11) 
k 2n 

Thus the total energy of the field is expressed as a sum of the energies &,, associated with 

each of the plane waves individually. 


In a completely analogous fashion, we can calculate the total momentum of the field, 


for which we obtain 


Ok (52.12) 


This result could have been anticipated in view of the relation between the energy and 
momentum of a plane wave (see § 47). 

The expansion (52.9) succeeds in expressing the field in terms of a series of discrete 
parameters (the vectors a,). in place of the description in terms of a continuous series of 
parameters, which is essentially what is done when we give the potential A(x, v, z, t) at all 
points of space. We now make a transformation of the variables a,, which has the result 
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that the equations of the field take on a form similar to the canonical equations (Hamilton 
equations) of mechanics. 
We introduce the real “canonical variables" Q, and P, according to the relations 


V 
Q, = inet (a, - a4), (52.13) 


V 
P, = - io, AJ ane (a, — ap) = Q,. 


The Hamiltonian of the field is obtained by substituting these expressions in the energy 
(52.11): e 
X =Y Hy = YAP + 0101). (52.14) 
k k 


Then the Hamilton equations 6/éP, = Q, coincide with P, = Q,, which is thts a con- 
sequence of the equations of motion. (This was achieved by an appropriate choice of the 
coefficient in (52.13).) The equations of motion, 63€ /6Q, = —P,. become the equations 


Q, - 0;Q, = 0, (52.15) 
that is, they are identical with the equations of the field. 

Each of the vectors Q, and P, is perpendicular to the wave vector k, i.e. has two in- 
dependent components. The direction of these vectors determines the direction of polariza- 
tion of the corresponding travelling wave. Denoting the two components of the vector Q, 
(in the plane perpendicular to k) by Q,;, j = 1, 2, we have 
and similarly for P,. Then à 

X =) X, Hy; = AP; 0107). (52.16) 
kj 


We see that the Hamiltonian splits into a sum of independent terms Xj, each of which 
contains only one pair of the quantities Q,;, P,;. Each such term corresponds to a traveling 
wave with a definite wave vector and polarization. The quantity J£,; has the form of the 
Hamiltonian of a one-dimensional “oscillator”, performing a simple harmonic vibration. 
For this reason, one sometimes refers to this result as the expansion of the field in terms of 
oscillators. 

We give the formulas which express the field explicitly in terms of the variables Py, Q,- 
From (52.13), we have 


i [n i fn , 
a, = k Rie (P, — iwQ), ap = — k E (P, +i). (52.17) 


Substituting these expressions in (52.1), we obtain for the vector potential of the field: 


CAE | 
A - 2 | Y E ekQ cos er- Pu sin kc) (52.18) 
k 
For the electric and magnetic fields, we find 
E--2 a> (ckQ, sink-r+P, cos k'r), 
k 


ad (52.19) 
H = —2 Y. Leo QU sin k-r+(kx Pj cos k'r}. 
k 


CHAPTER 7 


THE PROPAGATION OF LIGHT 


§ 53. Geometrical optics 


A plane wave is characterized by the property that its direction of propagation and 
amplitude are the same everywhere. Arbitrary electromagnetic waves, of course, do not 
have this property. Nevertheless, a great many electromagnetic waves, which are not plane, 
have the property that within each small region of space they can be considered to be plane. 
For this, it is clearly necessary that the amplitude and direction of the wave remain practically 
constant over distances of the order of the wavelength. If this condition is satisfied, we can 
introduce the so-called wave surface, i.e. a surface at all of whose points the phase of the 
wave is the same (at a given time). (The wave surfaces of a plane wave are obviously 
planes perpendicular to the direction of propagation of the wave.) In each small region of 
space we can speak of a direction of propagation of the wave, normal to the wave surface. 
In this way we can introduce the concept of rays—curves whose tangents at each point 
coincide with the direction of propagation of the wave. 

The study of the laws of propagation of waves in this case constitutes the domain of 
geometrical optics. Consequently, geometrical optics considers the propagation of waves, 
in particular of light, as the propagation of rays, completely divorced from their wave 
properties. In other words, geometrical optics corresponds to the limiting case of small 
wavelength, À — 0. 

We now take up the derivation of the fundamental equation of geometrical optics—the 
equation determining the direction of the rays. Let f be any quantity describing the field 
of the wave (any component of E or H). For a plane monochromatic wave, f has the form 


f= ag r7 orte) = ae ~kix! +a) (53.1) 


(we omit the Re; it is understood that we take the real part of all expressions). 
We write the expression for the field in the form 


f 5 ae", (53.2) 


In case the wave is not plane, but geometrical optics is applicable, the amplitude a is, 
generally speaking, a function of the coordinates and time, and the phase y, which is called 
the eikonal, does not have a simple form, as in (53.1). It is essential, however, that y be a 
large quantity. This is clear immediately from the fact that it changes by 2x when we move 
through one wavelength, and geometrical optics corresponds to the limit 4 — 0. 
Over small space regions and time intervals the eikonal i can be expanded in series; to 
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terms of first order, we have 


oy |, OV 
y= Vor atia 


(the origin for coordinates and time has been chosen within the space region and time 
interval under consideration; the derivatives are evaluated at the origin). Comparing this 
expression with (53 1), we can write 


Oy — 
k= or grad y, w= Tp (53.3) 


which corresponds to the fact that in each small region of space (and each small interval of 
time) the wave can be considered as plane. In four-dimensional form, the relation (53.3) is 
expressed as 


k= - (53.4) 


where k; is the wave four-vector. 


We saw in § 48 that the components of the four-vector K' are related by k,k' = 0. Sub- 
stituting (53.4), we obtain the equation 


= = = 0. (53.5) 


This equation, the eikonal equation, is the fundamental equation of geometrical optics. 
The eikonal equation can also be derived by direct transition to the limit A — 0 in the 
wave equation. The field f satisfies the wave equation 


KA 
ax,0x' — 
Substituting f = ae”, we obtain 
0*a da oy aw oy ow 
e" el 
Inar 17 ay ay T acad ax ax =? (53.6) 


But the eikonal y, as we pointed out above, is a large quantity; therefore we can neglect 
the first three terms compared with the fourth, and we arrive once more at equation (53.5). 

We shall give certain relations which, in their application to the propagation of light in 
vacuum, lead only to completely obvious results. Nevertheless, they are important because, 
in their general form, these derivations apply also to the propagation of light in material 
media. 

From the form of the eikonal equation there results a remarkable analogy between 
geometrical optics and the mechanics of material particles. The motion of a material 
particle is determined by the Hamilton-Jacobi equation (16.11). This equation, like the 
eikonal equation, is an equation in the first partial derivatives and is of second degree. As 
we know, the action S is related to the momentum p and the Hamiltonian 2f of the particle 
by the relations 

. 08 _ ôS 
PT Ot 


Comparing these formulas with the formulas (53.3), we see that the wave vector plays the 
same role in geometrical optics as the momentum of the particle in mechanics, while the 
frequency plays the role of the Hamiltonian, i.e., the energy of the particle. The absolute 
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magnitude k of the wave vector is related to the frequency by the formula k = w/c. This 
relation ts analogous to the relation p = &/c between the momentum and energy of a particle 
with zero mass and velocity equal to the velocity of light. 

For a particle, we have the Hamilton equations 


CH . OF 
p= "a; =T1= 
cr cp 
In view of the analogy we have pointed out, we can immediately write the corresponding 
equations for rays: 
k=--@ is oo 
ek 
In vacuum, o = ck, so that k = 0, v = cn (n is a unit vector along the direction of propaga- 
tion); in other words, as it must be, in vacuum the rays are straight lines, along which the 
light travels with velocity c. 

The analogy between the wave vector of a wave and the momentum of a particle is made 
especially clear by the following consideration. Let us consider a wave which is a super- 
position of monochromatic waves with frequencies in a certain small interval and occupying 
some finite region in space (this is called a wave packet). We calculate the four-momentum of 
the field of this wave, using formula (32.6) with the energy-momentum tensor (48.15) (for 
each monochromatic component). Replacing k' in this formula by some average value, 
we obtain an expression of the form 


(53.7) 


a? 
or 


P! = Aki, (53.8) 
where the coefficient of proportionality A between the two four-vectors P! and K! is some 
scalar. In three-dimensional form this relation gives: 

P = Ak, ó = Aw. (53.9) 
Thus we see that the momentum and energy of a wave packet transform, when we go from 
one reference system to another, like the wave vector and the frequency 
Pursuing the analogy, we can establish for geometrical optics a principle analogous to the 
principle of least action in mechanics. However, it cannot be written in Hamiltonian form as 
ôf Ldt = 0, since it turns out to be impossible to introduce, for rays, a function analogous 
to the Lagrangian of a particle. Since the Lagrangian of a particle is related to the Hamil- 
tonian # by the equation L — p-:CJ€/0p—J£, replacing the Hamiltonian # by the 
frequency œw and the momentum by the wave vector k, we should have to write for the 
Lagrangian in optics k -€w/€k—q. But this expression is equal to zero, since w = ck. The 
impossibility of introducing a Lagrangian for rays is also clear directly from the considera- 
tion mentioned earlier that the propagation of rays is analogous to the motion of particles 
with zero mass. 
If the wave has a definite constant frequency w, then the time dependence of its field is 
given by a factor of the form e "*'. Therefore for the eikonal of such a wave we can write 


= —wt+w w(x, y, Z), (53.10) 


where Wo is a function only of the coordinates. The eikonal equation (53.5) now takes the 
form 


(grad Yo)? = 


2 
w 
T (53.11) 


The wave surfaces are the surfaces of constant eikonal, i.e. the family of surfaces of the form 


§ 54 INTENSITY 133 


Wo(x, y, z) = const. The rays themselves are at each point normal to the corresponding 
wave surface; their direction is determined by the gradient Vij. 

As is well known, in the case where the energy is constant, the principle of least action 
for particles can also be written in the form of the so-called principle of Maupertuis: 


ôS =ô | p-dl=0, 


where the integration extends over the trajectory of the particle between two of its points. 
In this expression the momentum is assumed to be a function of the energy and the co- 
ordinates. The analogous principle for rays is called Fermat's principle. In this case, we 
can write by analogy: 


by = 5 [ k-di=0. (53.12) 
In vacuum, k = (@/c)n, and we obtain (d]-n = dl): 
à f dl =0, (53.13) 


which corresponds to rectilinear propagation of the rays. 


§ 54. Intensity 


In geometrical optics, the light wave can be considered as a bundle of rays. The rays 
themselves, however, determine only the direction of propagation of the light at each point; 
there remains the question of the distribution of the light intensity in space. 

On some wave surface of the bundle of rays under consideration, we isolate an in- 
finitesimal surface element. From differential geometry it is known that every surface has. 
at each of its points, two (generally different) principal radii of curvature. Let ac and bd 
(Fig. 7) be elements of the principal circles of curvature, constructed at a given element of 


the wave surface. Then the rays passing through a and c meet at the corresponding center of 
curvature O,, while the rays passing through b and d meet at the other center of curvature 
O2. 

For fixed angular openings of the beams starting from O, and O,, the lengths of the arcs 
ac and bd are, clearly, proportional to the corresponding radii of curvature R, and R, (i.e. 
to the lengths O, O and O, O). The area of the surface element is proportional to the product 
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of the lengths ac and bd, i.e., proportional to R, R3. In other words, if we consider the 
element of the wave surface bounded by a definite set of rays, then as we move along them 
the area of the element will change proportionally to R, R3. 

On the other hand, the intensity, i.e. the energy flux density, is inversely proportional to 
the surface area through which a given amount of light energy passes. Thus we arrive at the 
result that the Intensity is 

const 
~ R R; 

This formula must be understood as follows. On each ray (AB in Fig. 7) there are definite 
points O, and O;, which are the centers of curvature of all the wave surfaces intersecting 
the given ray. The distances OO, and OO, from the point O where the wave surface inter- 
sects the ray, to the points O, and O,, are the radii of curvature R, and R, of the wave 
surface at the point O. Thus formula (54.1) determines the change in intensity of the light 
along a given ray as a function of the distances from definite points on this ray. We emphasize 
that this formula cannot be used to compare intensities at different points on a single wave 
surface. 

Since the intensity is determined by the square modulus of the field, we can write for the 
change of the field itself along the ray 


(54.1) 


Lo eM, (54.2) 


where in the phase factor e"^ we can write either e^: cr e“®?. The quantities e'**' and 
e'*^: (for a given ray) differ from each other only by a constant factor, since the difference 
R,— R;, the distance between the two centers of curvature, is a constant. 

If the two radii of curvature of the wave surface coincide, then (54.1) and (54.2) have the 
form: 


I const const 


ue SEU. ele (54.3) 
This happens always when the light is emitted from à point source (the wave surfaces are 
then concentric spheres and R is the distance from the light source). 

From (54.1) we see that the intensity becomes infinite at the points R, — 0, R, —0, i.e. 
at the centers of curvature of the wave surface. Applving this to all the rays in a bundle, we 
find that the intensity of the light in the given bundle becomes infinite, generally, on two 
surfaces—the geometrical loci of all the centers of curvature of the wave surfaces. These 
surfaces are called caustics. In the special case of a beam of ravs with spherical wave surfaces, 
the two caustics fuse into a single point ( focus). 

We note from well-known results of diflerential geometry concerning the properties of the 
loci of centers of curvature of a family of surfaces, that the rays are tangent to the caustic. 

It is necessary to keep in mind that (for convex wave surfaces) the centers of curvature of 
the wave surfaces can turn out to lie not on the rays themselves, but on their extensions 
beyond the optical system from which they emerge. In such cases we speak of imaginary 
caustics (or foci). In this case the intensity of the light does not become infinite anywhere. 

As for the increase of intensity to infinity, in actuality we must understand that the 
intensity does become large at points on the caustic, but it remains finite (see the problem 
in § 59). The formal increase to infinity means that the approximation of geometrical optics 
is never applicable in the neighborhood of the caustic. To this is related the fact that the 
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change in phase along the ray can be determined from formula (54.2) only over sections of 
the ray which do not include its point of tangency to the caustic. Later (in § 59), we shall 
show that actually in passing through the caustic the phase of the field decreases by 2/2. 
This means that if, on the section of the ray before its first intersection with the caustic, the 
field is proportional to the factor e'* (x is the coordinate along the ray), then after passage 
through the caustic the field will be proportional to e'** 7/2», The same thing occurs in the 
neighborhood of the point of tangency to the second caustic, and beyond that point the field 
is proportional to e'C*- t 


§ 55. The angular eikonal 


A light ray traveling in vacuum and impinging on a transparent body will, on its emergence 
from this body, generally have a direction different from its initial direction. This change in 
direction will, of course, depend on the specific properties of the body and on its form. 
However, it turns out that one can derive general laws relating to the change in direction of a 
light ray on passage through an arbitrary material body. In this it is assumed only that 
geometrical optics is applicable to rays propagating in the interior of the body under con- 
sideration. As is customary, we shall call such transparent bodies, through which rays of 
light propagate, optical systems. 

Because of the analogy mentioned in § 53, between the propagation of rays and the motion 
of particles, the same general laws are valid for the change in direction of motion of a particle, 
initially moving in a straight line in vacuum, then passing through some electromagnetic 
field, and once more emerging into vacuum. For definiteness, we shall, however, always 
speak later of the propagation of light rays. 

We saw in a previous section that the eikonal equation, describing the propagation of the 
rays, can be written in the form (53.11) (for light of a definite frequency). From now on we 
shall, for convenience, designate by y the eikonal yo divided by the constant w/c. Then the 
basic equation of geometrical optics has the form: 


(Vy)? = 1. (55.1) 


Each solution of this equation describes a definite beam of rays, in which the direction 
of the rays passing through a given point in space is determined by the gradient of y at that 
point. However, for our purposes this description is insufficient, since we are seeking general 
relations determining the passage through an optical system not of a single definite bundle of 
rays, but of arbitrary rays. Therefore we must use an eikonal expressed in such a form 
that it describes all the generally possible rays of light, i.e. rays passing through any pair of 
points in space. In its usual form the eikonal v/(r) is the phase of the rays in a certain bundle 
passing through the point r. Now we must introduce the eikonal as a function y(r, r^) of the 
coordinates of two points (r, r' are the radius vectors of the initial and end points of the ray). 
A ray can pass through each pair of points r, r', and W(r, r^) is the phase difference (or, as it 
is called, the optical path length) of this ray between the points r and r'. From now on we 
shall always understand by r and r' the radius vectors to points on the ray before and after 
its passage through the optical system. 


t Although formula (54.2) itself is not valid near the caustic, the change in phase of the field corresponds 
formally to a change in sign (i.e. multiplication by e") of R, or R3 in this formula. 
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If in Wr, r’) one of the radius vectors, say r’, is fixed, then y as a function of r describes 

a definite bundle of rays, namely, the bundle of rays passing through the point r’. Then y 

must satisfy equation (55.1), where the differentiations are applied to the components of r. 
Similarly, if r is assumed fixed, we again obtain an equation for yr, r’), so that 

(Vy =l, (Vay =l. (55.2) 

The direction of the ray is determined by the gradient of its phase. Since y(r, r^) is the 

difference in phase at the points r and r', the direction of the ray at the point r’ is given by 

the vector n’ = 0w/dr’, and at the point r by the vector n = — oy/ór. From (55.2) it is clear 

that n and n’ are unit vectors: 
n? — n^? =]. (55.3) 
The four vectors r, r', n, n' are interrelated, since two of them (n, n’) are derivatives of a 


certain function y with respect to the other two (r, r’). The function y itself satisfies the 
auxiliary conditions (55.2). 

To obtain the relation between n, n', r, r', it is convenient to introduce, in place of y, 
another quantity, on which no auxiliary condition is imposed (i.e., is not required to satisfy 
any differential equations). This can be done as follows. In the function y the independent 
variables are r and r', so that for the differential dy we have 


dj = zp dr+ sede’ = -ndren dr. 


We now make a Legendre transformation from r, r’ to the new independent variables n, 
n', that is, we write 
di = —d(n:r) t r- dnt d(n':r)—r': dn’, 
from which, introducing the function 
x= r nry, (55.4) 
we have 
dy = —r-dn+r’-dn’. (55.5) 
The function x is called the angular eikonal; as we see from (55.5), the independent 
variables in it are n and n'. No auxiliary conditions are imposed on y. In fact, equation (55.3) 
now states only a condition referring to the independent variables: of the three components 
Ny, Ny, nz, Of the vector n (and similarly for n^), only two are independent. As independent 
variables we shall use n, n, n5, n;; then 
n, = V1—- n1 n, ng = V1- n? — nj. 
Substituting these expressions in 
dy = —xdn,—ydn,—z dn, * x' dn, 4 y' dny z' dni, 
we obtain for the differential dy: 


n v ee 
dy = — (»- 2 x) dn,— (2-% x} dn,+ (v- a x’) a+ (2 - Zx ) dn;. 
ny nx n, n 


From this we obtain, finally, the following equations: 


ð ð 
"CN 3 
n, On, n, on, (55.6) 
jowl. NES ; 
tips OE i PN) 
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which is the relation sought between n, n', r, r’. The function y characterizes the special 
properties of the body through which the rays pass (or the properties of the field, in the case 
of the motion of a charged particle). 

For fixed values of n, n', each of the two pairs of equations (55.6) represent a straight line. 
These lines are precisely the rays before and after passage through the optical system. Thus 
the equation (55.6) directly determines the path of the ray on the two sides of the optical 
system. 


$ 56. Narrow bundles of rays 


In studying the passage of beams of rays through optical systems, special interest attaches 
to bundles whose rays all pass through one point (such bundles are said to be homocentric). 

After passage through an optical system, homocentric bundles in general cease to be 
homocentric, i.e. after passing through a body the rays no longer come together in any one 
point. Only in exceptional cases will the rays starting from a luminous point come together 
after passage through an optical system and all meet at one point (the image of the luminous 
point).t 

One can show (see § 57) that the only case for which all homocentric bundles remain 
strictly homocentric after passage through the optical system is the case of identical imaging, 
i.e., the case where the image differs from the object only in its position or orientation, or is 
mirror inverted. 

Thus no optical system can give a completely sharp 1mage of an object (having finite 
dimensions) except in the trivial case of identical imaging. Only approximate, but not 
completely sharp images can be produced of an extended body, in any case other than for 
identical imaging. 

The most important case where there is approximate transition of homocentric bundles 
into homocentric bundles is that of sufficiently narrow beams (i.e. beams with a small 
opening angle) passing close to a particular line (for a given optical system). This line is 
called the optic axis of the system. 

Nevertheless, we must note that even infinitely narrow bundles of rays (in the three- 
dimensional case) are in general not homocentric; we have seen (Fig. 7) that even in such a 
bundle different rays intersect at different points (this phenomenon is called astigmatism). 
Exceptions are those points of the wave surface at which the two principal radii of curvature 
are equal—-a small region of the surface in the neighborhood of such points can be considered 
as spherical, and the corresponding narrow bundle of rays is homocentric. 

We consider an optical system having axial symmetry.§ The axis of symmetry of the 
system is also its optical axis. The wave surface of a bundle of rays traveling along this axis 
also has axial symmetry; as we know, surfaces of rotation have equal radii of curvature at 
their points of intersection with the symmetry axis. Therefore a narrow bundle moving in this 
direction remains homocentric. 


t The point of intersection can lie either on the rays themselves or on their continuations; depending on 
this, the image is said to be real or virtual. 

t Such imaging can be produced with a plane mirror. 

$ It can be shown that the problem of image formation with the aid of narrow bundles, moving in the 
neighborhood of the optical axis in a nonaxially-symmetric system, can be reduced to image formation in 
an axially-symmetric system plus a subsequent rotation of the image thus obtained, relative to the object. 
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To obtain general quantitative relations, determining image formation with the aid of 
narrow bundles, passing through an axially-symmetric optical system, we use the general 
equations (55.6) after determining first of all the form of the function x in the case under 
consideration. 

Since the bundles of rays are narrow and move in the neighborhood of the optical axis, 
the vectors n, n’ for each bundle are directed almost along this axis. If we choose the optical 
axis as the X axis, then the components n,, n., n;, n; will be small compared with unity. As 
for the components n,, nz; Ny & l and n; can be approximately equal to either +1 or —1. 
In the first case the rays continue to travel almost in their original direction, emerging into 
the space on the other side of the optical system, which in this case is called a /ens. In the 
second the rays change their direction to almost the reverse; such an optical system is called 
a mirror. 

Making use of the smallness of n,, n,, np, n;, we expand the angular eikonal 7 (ny, n;, n,, 
n.) in series and stop at the first terms. Because of the axial symmetry of the whole system, 
y must be invariant with respect to rotations of the coordinate system around the optical 
axis. From this it is clear that in the expansion of z there can be no terms of first order, 
proportional to the first powers of the 1- and z-components of the vectors n and n'; such 
terms would not have the required invariance. The terms of second order which have the 
required property are the squares n^ and n’? and the scalar product n: n'. Thus, to terms of 
second order, the angular eikonal of an axially-symmetric optical system has the form 


y = const + ; (n3 c n2) - f(nyny 4 n, nl) i (n? +0177), (56.1) 


where f, g, h are constants. 

For definiteness, we now consider a lens. so that we set n; x 1; for a mirror, as we shall 
show later, all the formulas have a similar appearance. Now substituting the expression 
(56.1) in the general equations (55.6), we obtain: 


n(x-g)—-fn,— y,  fn,*n(x +h) = y’, 


nf(x—g)—fnl = 2, fn, tniix'+hy)= z. (36:2) 


We consider a homocentric bundle emanating from the point x, y, z; let the point x’, y' z' be 
the point in which all the rays of the bundle intersect after passing through the lens. If the first 
and second pairs of equations (56.2) were independent, then these four equations, for given 
X, Y, Z, X', Y'. z', would determine one definite set of values n. 71, ny, n; that is, there would 
be just one ray starting from the point x, v, z, which would pass through the point x’, v, =’. 
In order that all rays starting from x, y, z shall pass through x’, 1^, z’, it is consequently 
necessary that the equations (56.2) not be independent, that is, one pair of these equations 
must be a consequence of the other. The necessary condition for this dependence is that the 
coefficients in the one pair of equations be proportional to the coefficients of the other pair. 
Thus we must have 


x-9 "mo o} Z 
f{ xh y z 559) 
In particular. 
(x—g)(x' +h) = —f?. (56.4) 


The equations we have obtained give the required connection between the coordinates of 
the image and object for image formation using narrow bundles. 
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The points x = g and x’ = —/ onthe optical axis are called the principal foci of the optical 
system. Let us consider bundles of rays parallel to the optical axis. The source point of such 
rays is, clearly, located at infinity on the optical axis, that is, x = oo. From (56.3) we see that 
in this case, x’ = —h. Thus a parallel bundle of rays, after passage through the optical 
system, intersects at the principal focus. Conversely, a bundle of rays emerging from the 
principal focus becomes parallel after passage through the system. 

In the equation (56.3) the coordinates x and x' are measured from the same origin of co- 
ordinates, lying on the optical axis. It is, however, more convenient to measure the co- 
ordinates of object and image from different origins, choosing them at the corresponding 
principal foci. As positive direction of the coordinates we choose the direction from the 
corresponding focus toward the side to which the light travels. Designating the new co- 
ordinates of object and image by capital letters, we have 


X-2x-4g, X’ =x +h, Y=y, Y'= y, Z =z, Z=. 
The equations of image formation (56.3) and (56.4) in the new coordinates take the form 
XX’ = -f’, (56.5) 


Bac uU Aeron (56.6) 


The quantity f is called the principal focal length of the system. 

The ratio Y’/Y is called the lateral magnification. As for the longitudinal magnification, 
since the coordinates are not simply proportional to each other, it must be written in dif- 
ferential form, comparing the length of an element of the object (along the direction of the 
axis) with the length of the corresponding element in the image. From (56.5) we get for the 


“longitudinal magnification" 

D 2 wy 2 

as = L i (7) i (56.7) 
dX| X Y 

We see from this that even for an infinitely small object, it is impossible to obtain a 
geometrically similar image. The longitudinal magnification is never equal to the transverse 
(except in the trivial case of identical imaging). 

A bundle passing through the point X — f on the optical axis intersects once more at the 
point X' = —f on the axis; these two points are called principal points. From equation 
(56.2) (n, X — fn, = Y, n, X —fn; = Z) it is clear that in this case (X — f, Y= Z —0), we 
have the equations n, = n, n, = nj. Thus every ray starting from a principal point crosses 
the optical axis again at the other principal point in a direction parallel to its original 
direction. 

If the coordinates of object and image are measured from the principal points (and not 
from the principal foci), then for these coordinates č and č’, we have 


č =X +f, č=X-f. 
Substituting in (56.5) it is easy to obtain the equations of image formation in the form 


LES NE. (56.8) 


One can show that for an optical system with small thickness (for example, a mirror or a 
thin lens), the two principal points almost coincide. In this case the equation (56.8) is 
particularly convenient, since in it č and č’ are then measured practically from one and the 
same point. 
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If the focal distance is positive, then objects located in front of the focus (X > 0) are 
imaged erect ( Y'/ Y > 0); such optical systems are said to be converging. If f < 0, then for 
X >0 we have Y '/Y <0, that is, the object is imaged in inverted form; such systems are 
said to be diverging. 

There is one limiting case of image formation which is not contained in the formulas 
(56.8); this is the case where all three coefficients f, g, ^ are infinite (i.e. the optical system has 
an infinite focal distance and its principal foci are located at infinity). Going to the limit of 
infinite f, g, // in (56.4) we obtain 


h ?—gh 
MEL mi gh 

g g 
Since we are interested only in the case where the object and its image are located at finite 
distances from the optical system, f. g, A must approach infinity in such fashion that the 
ratios h/g, (f? —gh)'g are finite. Denoting them, respectively, by a? and fl, we have 


x' 2 aix f. 


For the other two coordinates we now have from the general equation (56.7): 


Finally, again measuring the coordinates x and x' from different origins, namely from some 
arbitrary point on the axis and from the image of this point, respectively, we finally obtain 
the equations of image formation in the simple form 


X' = 2X, Y'= aY, Z = taZ. (56.9) 


Thus the longitudinal and transverse magnifications are constants (but not equal to each 
other). This case of image formation is called telescopic. 

All the equations (56.5) through (56.9), derived by us for lenses, apply equally to mirrors, 
and even to an optical system without axial symmetry, if only the image formation occurs 
by means of narrow bundles of rays traveling near the optical axis. In this, the reference 
points for the x coordinates of object and image must always be chosen along the optical 
axis from corresponding points (principal foci or principal points) in the direction of propaga- 
tion of the ray. In doing this, we must keep in mind that for an optical system not possessing 
axial symmetry, the directions of the optical axis in front of and beyond the system do not lie 
in the same plane. 


PROBLEMS 


1, Find the focal distance for image formation with the aid of two axially-symmetric optical 
systems whose optical axes coincide. 


Solution: Let f, and fy be the focal lengths of the two systems. For each system separately, we 
have 
XX =-f’, X2 X, = —f,?. 

Since the image produced by the first system acts as the object for the second, then denoting by / the 
distance between the rear principal focus of the first system and the front focus of the second, we 
have X; = X,' —l; expressing Y,' in terms of Xi, we obtain 

(QUAS 

toféMX, 
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fy Q £N (ARM 
c Ricette 
from which it is clear that the principal foci of the composite system are located at the points 
X, = —f yl, X; = f? |l and the focal length is 


(to choose the sign of this expression, we must write the corresponding equation for the transverse 
magnification). 


or 


Fic. 8. 


In case / — 0, the focal length f = oo, that is, the composite system gives telescopic image for- 
mation. In this case we have X,' = X(/2/f,)?, that is, the parameter « in the general formula 


(56.9) is a= falf. 


2. Find the focal length for charged particles of a “magnetic lens" in the form of a longitudinal 
homogeneous field in the section of length / (Fig. 8).t 

Solution: The kinetic energy of the particle is conserved during its motion in a magnetic field; 
therefore the Hamilton-Jacobi equation for the reduced action So(r) (where the total action is 


S= — gt + So) is 
V £A =p? 
So =p, 


2 e? 242 
pP = zome =const. 


where 


Using formula (19.4) for the vector potential of the homogeneous magnetic field, choosing the x 
axis along the field direction and considering this axis as the optical axis of an axially-symmetric 
optical system, we get the Hamilton-Jacobi equation in the form: 


AY 2 AY 2 e? 2,2. 2 
(x) (S taa Ho = p*, (1) 


where r is the distance from the x axis, and So is a function of x and r. 
For narrow beams of particles propagating close to the optical axis, the coordinate r is small, so 
that accordingly we try to find So as a power series in r. The first two terms of this series are 


So = px + to(x)r?, (2) 
where c (x) satisfies the equation 
2 
pa (x)+6"-- 4 H? x0 (3) 
4c? 
In region 1 in front of the lens, we have: 
of) = P T 
X—2X 


where x, <0 is a constant. This solution corresponds to a free beam of particles, emerging along 
straight line rays from the point x = x, on the optical axis in region 1. In fact, the action function 


t This might be the field inside a long solenoid, when we neglect the disturbance of the homogeneity of the 
field near the ends of the solenoid. 
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for the free motion of a particle with a momentum p in a direction out from the point x — x, is 


S Vr? t(x- vi zx plx) pe 
=py Xx- Ni) axa phx.: LI—— 
o =P i) =P 1 Xx—3i) 
Similarly, in region 2 behind the lens we write: 
gà = P 
x—Xx,’ 


where the constant x; is the coordinate of the image of the point x;. 
In region 3 inside the lens, the solution of equation (3) is obtained by separation of variables, 
and gives: 


where C is an arbitrary constant. 
The constant C and x; (for given +) are determined by the requirements of continuity of o(x) 


for x = O and x = f: 
p eH p eH eH 
-> = — . —— = —cot { —/+C}. 
Xx, 2¢ oe l=x 2c 2 2cp — e 


Eliminating the constant C from these equations, we find: 


(xi —2) Ga +4) = —f7. 


wheret 
2cp eH 
= 5H 93 P h-g-l, 
2cp 
f- 7 
H sin 2 
E 2c 


§ 57. Image formation with broad bundles of rays 


The formation of images with the aid of narrow bundles of rays, which was considered 
in the previous section, ts approximate; it is the more exact (i.e. the sharper) the narrower 
the bundles. We now go over to the question of image formation with bundles of rays of 
arbitrary breadth. 

In contrast to the formation of an image of an object by narrow beams, which can be 
achieved for any optical system having axial symmetry, image formation with broad beams 
is possible only for specially constituted optical systems. Even with this limitation, as already 
pointed out in § 56, image formation is not possible for all points in space. 

The later derivations are based on the following essential remark. Suppose that all rays. 
starting from a certain point O and traveling through the optical system, intersect again at 
some other point O'. It is easy to see that the optical path length y is the same for all these 
rays. In the neighborhood of each of the points O, O', the wave surfaces for the rays inter- 
secting in them are spheres with centers at O and O', respectively, and, in the limit as we 
approach O and O', degenerate to these points. But the wave surfaces are the surfaces of 
constant phase, and therefore the change in phase along different rays, between their points 
of intersection with two given wave surfaces, is the same. From what has been said, it follows 
that the total change in phase between the points O and O’ is the same (for the different 
rays). 


f The value of / is given with the correct sign. However, to show this requires additional investigation. 
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Let us consider the conditions which must be fulfilled in order to have formation of an 
image of a small line segment using broad beams; the image is then also a small line segment. 
We choose the directions of these segments as the directions of the č and £' axes, with origins 
at any two corresponding points O and O' of the object and image. Let y be the optical path 
length for the rays starting from O and reaching O’. For the rays starting from a point 
infinitely near to O with coordinate d£, and arriving at a point of the image with coordinate 
dé’, the optical path length is y -- dy, where 


ow oy 
= — dě + dé. 
dy Jë St ae" č 
We introduce the “magnification” 
P 
e dé 


as the ratio of the length d£' of the element of the image to the length dé of the imaged 
element. Because of the smallness of the line segment which is being imaged, the quantity « 
can be considered constant along the line segment. Writing, as usual. &y/CG = —ng, 
Oy [OG = n; (ng, n; are the cosines of the angles between the directions of the ray and the 
corresponding axes č and £^, we obtain 


dy = (v; n; —ngd6. 
As for every pair of corresponding points of object and image, the optical path length 


V +d must be the same for all rays starting from the point d£ and arriving at the point dé’. 
From this we obtain the condition: 


nN: —N, = Const. (57.1) 


This is the condition we have been seeking, which the paths of the rays in the optical system 
must satisfy in order to have image formation for a small line segment using broad beams. 
The relation (57.1) must be fulfilled for all rays starting from the point O. 

Let us apply this condition to image formation by means of an axially-symmetric optical 
system. We start with the image of a line segment coinciding with the optical axis (x axis); 
clearly the image also coincides with the axis. A ray moving along the optical axis (n, = 1), 
because of the axial symmetry of the system, does not change its direction after passing 
through it, that is, n; is also 1. From this it follows that const in (57.1) is equal in this case 
to a,—1, and we can rewrite (57.1) in the form 


l-n; 


7 = Oy. 
l-n; 

Denoting by 0 and 6’ the angles subtended by the rays with the optical axis at points of the 
object and image, we have 


D 


.,40 ; 39 
1—n, = 1—cos 0 = 2 sin? 5, 1—n; = 1—cos 6’ = 2 sin? y 
Thus we obtain the condition for image formation in the form 


sin ~ 
2 = 
g^ const = Va. (57.2) 


sin —- 
2 
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Next, let us consider the imaging of a small portion of a plane perpendicular to the optical 
axis of an axially symmetric system; the image will obviously also be perpendicular to this 
axis. Applying (57.1) to an arbitrary segment lying in the plane which is to be imaged, we get: 


æ, sin 0'—sin 0 = const, 


where N and 0° are again the angles made by the beam with the optical axis. For rays 
emerging from the point of intersection of the object plane with the optical axis, and directed 
along this axis (0 = 0), we must have 0’ = 0, because of symmetry. Therefore const is zero, 
and we obtain the condition for imaging in the form 


sin 0 


—— = const = &,. (57.3) 
sin 9 

As for the formation of an image of a three-dimensional object using broad beams, it is 
easy to see that this is impossible even for a small volume, since the conditions (57.2) and 
(57.3) are incompatible. 


§ 58. The limits of geometrical optics 


From the definition of a monochromatic plane wave, its amplitude is the same everywhere 
and at all times. Such a wave is infinite in extent in all directions in space, and exists over the 
whole range of time from — oo to +. Any wave whose amplitude is not constant every- 
where at all times can only be more or less monochromatic. We now take up the question of 
the "degree of non-monochromaticity" of a wave. 

l.et us consider an electromagnetic wave whose amplitude at each point is a function of 
the timc. Let c, be some average frequency of the wave. Then the field of the wave, for 
example the electric field, at a given point has the form Eo (z e" ""*'. This field, although it is of 
course not monochromatic, can be expanded in monochromatic waves, that is, in a Fourier 
integral. The amplitude of the component in this expansion, with frequency w, is propor- 
tional to the integral 

+a 

| Eee "gr. 

“a 
The factor e**7*?" is a periodic function whose average value is zero. If Ey were exactly 
constant, then the integral would be exactly zero, for o ¥ Wo. If, however, Ej(7) is variable, 
but hardly changes over a time interval of order |/|w—qp|, then the integral is almost equal 
to zero, the more exactly the slower the variation of Eo. In order for the integral to be sig- 
nificantly different from zero, it is necessary that E,(t) vary significantly over a time interval 
of the order of 1/|o — o. 

We denote by At the order of magnitude of the time interval during which the amplitude 
of the wave at a given point in space changes significantly. From these considerations, it now 
follows that the frequencies deviating most from wọ, which appear with reasonable intensity 
in the spectral resolution of this wave, are determined by the condition I/|w—wo|~Ar. If 
we denote by Aw the frequency interval (around the average frequency cog) which enters in 
the spectral resolution of the wave, then we have the relation 


AwAt~ d. (58.1) 
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We see that a wave is the more monochromatic (i.e. the smaller Aw) the larger At, i.e. the 
slower the variation of the amplitude at a given point in space. 

Relations similar to (58.1) are easily derived for the wave vector. Let Ax, Ar, Az be the 
orders of magnitude of distances along the X. Y. Z axes, in which the wave amplitude 
changes significantly. At a given time, the field of the wave as a function of the coordinates 
has the form 

E,(r)e*o *, 


where k, is some average value of the wave vector. By a completely analogous derivation to 
that for (58.1) we can obtain the interval Ak of values contained in the expansion of the wave 
into a Fourier integral: 


Ak, Ax ^ |, Ak Ay ~ ]. Ak, Az ~ d. (58.2) 


Let us consider, in particular. a wave which is radiated during a finite time interval. We 
denote by A: the order of magnitude of this interval. The amplitude at a given point in space 
changes signiiicantly during the time Af in the course of which the wave travels completely 
past the point. Because of the relations (58.1) we can now say that the "lack of mono- 
chromaticity” of such a wave. Aco, cannot be smaller than 1/A? (it can of course be larger): 

Aw x (58.3) 
oxs—. À 
At 

Similarly, if Ax, Ai, Az are the orders of magnitude of the extension of the wave in space, 
then for the spread in the values of components of the wave vector, entering in the resolution 
of the wave, we obtain 

l 1 1 
Ak.2 —, | Akh,2., Ak, S—. 58.4) 
T7 Ax > Ay Ta Az ( 

From these formulas it follows that if we have a beam of light of finite width. then the 
direction of propagation of the light in such a beam cannot be strictly constant. Taking the 
X axis along the (average) direction of light in the beam. we obtain 


0 m —— ~, (58.5) 


where 0, is the order of magnitude of the deviation of the beam from its average direction in 
the XY Y plane and 7 is the wavelength. 

On the other hand, the formula (58.5) answers the question of the limit of sharpness of 
optical image formation. A beam of light whose rays, according to geometrical optics, would 
all intersect in a point. actually gives an image not in the form of a point but in the form of a 
spot. For the width A of this spot. we obtain. according to (58.5). 

A b ue 58.6 

“KOTT ee) 

where 0 is the opening angle of the beam. This formula can be applied not only to the image 

but also to the object. Namely, we can state that in observing a beam of light emerging 

from a luminous point. this point cannot be distinguished from a body of dimensions 7/0. 

In this way formula (58.6) determines the limiting resolving power of a microscope. The 

minimum value of A, which is reached for 0 ~ 1, is 4, in complete agreement with the fact 
that the limit of geometrical optics is determined by the wavelength of the light. 


146 THE PROPAGATION OF LIGHT § 59 


PROBLEM 


Determine the order of magnitude of the smallest width of a light beam produced from a parallel 
beam at a distance / from a diaphragm. 

Solution: Denoting the size of the aperture in the diaphragm by d, we have from (58.5) for the 
angle of deflection of the beam (the "diffraction angle"). 4/d, so that the width of the beam is of order 


d- (4'd 4. The smallest value of this quantity ~ V 2L 


§ 59. Diffraction 


The laws of geometrical optics are strictly correct only in the ideal case when the wave- 
length can be considered to be infinitely small. The more poorlv this condition is fulfilled, 
the greater are the deviations from geometrical optics. Phenomenon which are the con- 
sequence of such deviations are called diffraction phenomena. 

Diffraction phenomena can be observed. for example, if along the path of propagation of 
the light t there is an obstacle—an opaque body (we call it a sereen) of arbitrary form or, for 
example, if the light passes through holes in opaque screens. If the laws of geometrical optics 
were strictly satisfied. there would be beyond the screen regions of "shadow" sharply 
delineated from regions where light falls. The diffraction has the consequence that. instead 
of a sharp boundary between light and shadow, there is a quite complex distribution of the 
intensity of the light. These diffraction phenomena appear the more strongly the smaller the 
dimensions of the screens and the apertures in them, or the greater the wavelength. 

The problem of the theory of diffraction consists in determining. for given positions and 
shapes of the objects (and locations of the light sources). the distribution of the light. that is. 
the electromagnetic field over all space. The exact solution of this problem is possible only 
through solution of the wave equation with suitable boundary conditions at the surface of 
the body. these conditions being determined also by the optical properties of the material. 
Such a solution usually presents great mathematical difficulties. 

However, there is an approximate method which for many cases is a satisfactory solution 
of the problem of the distribution of light near the boundary between light and shadow. This 
method is applicable to cases of small deviation from geometrical optics, i.e. when firstly, the 
dimensions of all bodies are large compared with the wavelength (this requirement applies 
both to the dimensions of screens and apertures and also to the distances from the bodies to 
the points of emission and observation of the light): and secondly when there are only small 
deviations of the light from the directions of the rays given by geometrical optics. 

Let us consider a screen with an aperture through which the light passes from given 
sources. Figure 9 shows the screen in profile (the heavy line); the light travels from left to 
right. We denote by i! some one of the components of E or H. Here we shall understand v 
to mean a function only of the coordinates, i.e. without the factor e" "" determining the time 
dependence. Our problem is to determine the light intensity, that is. the field uv, at any 
point of observation P beyond the screen. For an approximate solution of this problem in 
cases where the deviations from geometrical optics are small. we may assume that at the 
points of the aperture the field is the same as it would have been in the absence of the screen. 
In other words, the values of the field here are those which follow directly from geometrical 


+ In what follows, in discussing diffraction we shall talk of the diffraction of light; all these same con- 
siderations also apply, of course, to any electromagnetic wave. 
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optics. At all points immediately behind the screen, the field can be set equal to zero. In this 
the properties of the screen (i.e. of the screen material) obviously play no part. It is also 
obvious that in the cases we are considering, what is important for the diffraction is only the 
shape of the edge of the aperture, while the shape of the opaque screen is unimportant. 

We introduce some surface which covers the aperture in the screen and is bounded by its 
edges (a profile of such a surface is shown in Fig. 9 as a dashed line). We break up this surface 
into sections with arca df. whose dimensions are small compared with the size of the aperture, 
but large compared with the wavelength of the light. We can then consider each of these 
sections through which the light passes as if it were itself a source of light waves spreading 
out on all sides from this section. We shall consider the field at the point P to be the result of 
superposition of the fields produced by all the sections df of the surface covering the aperture. 
(This is called Huygens’ principle.) 

The field produced at the point P by the section dfis obviously proportional to the value w 
of the field at the section df itself (we recall that the field at df is assumed to be the same as it 
would have been in the absence of the screen). In addition, it is proportional to the projection 
df, of the area df on the plane perpendicular to the direction # of the ray coming from the 
light source to df. This follows from the fact that no matter what shape the element df 
has, the same rays will pass through it provided its projection df, remain fixed, and 
therefore its effect on the field at P will be the same. 

Thus the field produced at the point P by the section df is proportional to vdf,. Further- 
more, we must still take into account the change in the amplitude and phase of the wave 
during its propagation from df to P. The law of this change is determined by formula 
(54.3). Therefore u df, must be multiplied by (1/ R)e*? (where R is the distance from df to P. 
and K is the absolute value of the wave vector of the light). and we find that the required 
field is 

ik R 


cdi» 
au df. 


where a is an as yet unknown constant. The field at the point P, being the result of the 
addition of the fields produced by all the elements df, is consequently equal to 


eikR 
uaa fu R dfa (59.1) 
where the integral extends over the surface bounded by the edge of the aperture. In the 


approximation we are considering, this integral cannot, of course, depend on the form of this 
surface. Formula (59.1) is, obviously, applicable not only to diffraction by an aperture 
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in a Screen, but also to diffraction by a screen around which the light passes freely. In that 
case the surface of integration in (59.1) extends on all sides from the edge of the screen. 

To determine the constant a, we consider a plane wave propagating along the X axis; 
the wave surfaces are parallel to the plane YZ. Let u be the value of the field in the YZ plane. 
Then at the point P, which we choose on the X axis, the field is equal to u, = ue'**, On the 
other hand, the field at the point P can be determined starting from formula (59.1), choosing 
as surface of integration, for example, the YZ plane. In doing this, because of the smallness 
of the angle of diffraction, only those points of the YZ plane are important in the integral 
which lie close to the origin, i.e. the points for which y, z « x (x is the coordinate of the 
point P). Then 


and (59.1) gives 


-o -œ 
where u is a constant (the field in the YZ plane); in the factor 1/R, we can put R ~ x = const. 
By the substitution y = ëV 2x]k these two integrals can be transformed to the integral 


+o +o +a = 
Í eS dE = Í cos č? d£ +i | sin č? dë = Ja». 
=æ -2% -0 
and we get 
ikx 2in 
up = aue™ — . 
On the other hand, u, = ue™*, and consequently 
k 
a = I—. 
2ni 


Substituting in (59.1), we obtain the solution to our problem in the form 


ku 
u, = i gh gr. (59.2) 

In deriving formula (59.2), the light source was assumed to be essentially a point, and the 
light was assumed to be strictly monochromatic. The case of a real, extended source, which 
emits non-monochromatic light, does not, however, require special treatment. Because of the 
complete independence (incoherence) of the light emitted by different points of the source, 
and the incoherence of the different spectral components of the emitted light, the total 
diffraction pattern is simply the sum of the intensity distributions obtained from the diffrac- 
tion of the independent components of the light. 

Let us apply formula (59.2) to the solution of the problem of the change in phase of a ray 
on passing through its point of tangency to the caustic (see the end of $ 54). We choose as 
our surface of integration in (59.2) any wave surface, and determine the field u, at a point P, 
lying on some given ray at a distance x from its point of intersection with the wave surface 
we have chosen (we choose this point as coordinate origin O, and as YZ plane the plane 
tangent to the wave surface at the point O). In the integration of (59.2) only a small area of 
the wave surface in the neighborhood of O is important. If the XY Y and XZplanesare chosen 
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to coincide with the principal planes of curvature of the wave surface at the point O, then 
near this point the equation of the surface is 
2 2 
y z 
X => + > 
2R, 2R? 
where R, and R, are the radii of curvature. The distance R from the point on the wave 
surface with coordinates X, y, z, to the point P with coordinates x, 0, 0, is 
; y fl 1 z? /1 1 
R =Ë (x- X} +y +z ~xt5 (i- ~~ |p —(-— =~ |. 
(eX) vy 2A\x R, X R, 
On the wave surface, the field u can be considered constant; the same applies to the factor 
1/R. Since we are interested only in changes in the phase of the wave, we drop coefficients 
and write simply 


* o w(i S 


ONS Ri in’) "T 
| dze . (59.3) 


1 . eik* 2 
u~- | "dfa m — | dye 
i i 


-æ =- 00 

The centers of curvature of the wave surface lie on the ray we are considering, at the 
points x = R, and x = R,; these are the points where the ray is tangent to the caustic. 
Suppose R, < R,. For x < R;, the coefficients of i in the exponentials appearing in the two 
integrands are positive, and each of these integrals is proportional to (1+/). Therefore on 
the part of the ray before its first tangency to the caustic, we have u, ~ e™*. For Ra < x < Ry, 
that is, on the segment of the ray between its two points of tangency, the integral over y is 
proportional to 1 + i, but the integral over z is proportional to 1 —i, so that their product does 
not contain i. Thus we have here u, ~ — je^* = e/6*7 07? that is, as the ray passes in the 
neighborhood of the first caustic, its phase undergoes an additional change of — 7/2. 
Finally, for x > R,, we have u, ~ —e'* = e'** ^", that is, on passing in the neighborhood 
of the second caustic, the phase once more changes by — 7/2. 


PROBLEM 


Determine the distribution of the light intensity in the neighborhood of the point where the ray 
is tangent to the caustic. 

Solution: To solve the problem, we use formula (59.2), taking the integral in it over any wave 
surface which is sufficiently far from the point of tangency of the ray to the caustic. In Fig. 10, ab 
is a section of this wave surface, and a'b’ is a section of the caustic; a'b’ is the evolute of the curve 


Fic. 10. 


150 THE PROPAGATION OF LIGHT § 59 


ab. We are interested in the intensity distribution in the neighborhood of the point O where the ray 
QO is tangent to the caustic; we assume the length D of the segment QO of the ray to be large. We 
denote by x the distance from the point O along the normal to the caustic, and assume positive 
values x for points on the normal in the. direction of the center of curvature. 

The integrand in (59.2) is a function of the distance R from the arbitrary point Q’ on the wave 
surface to the point P. From a well-known property of the evolute, the sum of the length of the seg- 
ment Q'O' of the tangent at the point O' and the length of the arc OO’ is equal to the length QO 
of the tangent at the point O. For points O and O' which are near to each other we have OO’ = 00 
(e is the radius of curvature of the caustic at the point O). Therefore the length Q'O' = D — 6o. The 
distance Q'O (along a straight linc) is approximately (the angle 0 is assumed to be small) 


3 
QO-c-Q'O'-osn0- D—0e-esin0 = D—o = 


Finally, the distance R = Q'P is equal to R= Q'O —x sin 0 = Q'O — x0, that is, 
Rz D—x0—'00°. 
Substituting this expression in (59.2), we obtain 
VENTE ^ k 
Up ~ | e 6 dü-— a| cos (os - ej dO 
e o 


(the slowly varying factor 1/D in the integrand is unimportant compared with the exponential 
factor, so we assume it constant). Introducing the new integration variable č = (ko/2)! ? 0, we get 


512N1/3 
(ey 
e 
where @(r) is the Airy function.t 


For the intensity / ~ |u,!?, we write: 


ne 7 2k? TM m a 
e p. 


(concerning the choice of the constant factor, cf. below). 
For large positive values of x, we have from this the asymptotic formula 


A 4x3? Jk? 
Ix a exp | 3 PES 


t The Airy function P(r) is defined as 


l e 
M(t) = —= | cos | -- čt | de. (1 
Vn 3 
E 
(see Quantum Mechanics, Mathematical Appendices.§ b). For large positive values of the argument, the 
asymptotic expression for (t) is 
t 2 3;2 $ 
Mr) z zs exo (— $i ) (2) 


that is, D(/) goes exponentially to zero. For large negative values of r, the function ®(7) oscillates with 
decreasing amplitude according to the law: 


o ~ dol " ey 342 Ld GB 
(D) 3 es quens. 
The Airy function is related to the MacDonald function (modified Hankel function) of order 1/3: 
$1) = V tan Ki, GU). 4 


Formula (2) corresponds to the asymptotic expansion of K,(t): 


Kt) = NE 
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that is, the intensity drops exponentially (shadow region). For large negative values of x, we have 


2A 2(—-x8?. JIk? om 
z —== sin? HGA 
INg { 3 NE 4 


that is, the intensity oscillates rapidly; its average value over these oscillations is 


From this meaning of the constant A is clear—it is the intensity far from the caustic which would be 
obtained from geometrical optics neglecting diffraction effects. 
The function (7) attains its largest value, 0.949, for t = — 1.02; correspondingly, the maximum 
intensity is reached at x(2&2/9)! /? = — 1.02, where 
I = 2.03 AK13 716, 


At the point where the ray is tangent to the caustic (x = 0), we have J = 0.89 Ak 1? 9-1/6 [since 
(0) = 0.629]. 

Thus near the caustic the intensity is proportional to &1^, that is, to 4^1? (A is the wavelength). 
For 2-0, the intensity goes to infinity, as it should (see § 54). 


§ 60. Fresnel diffraction 


If the light source and the point P at which we determine the intensity of the light are 
located at finite distances from the screen, then in determining the intensity at the point P, 
only those points are important which lie in a small region of the wave surface over which we 
integrate in (59.2)—the region which lies near the line joining the source and the point P. 
In fact, since the deviations from geometrical optics are small, the intensity of the light 
arriving at P from various points of the wave surface decreases very rapidly as we move 
away from this line. Diffraction phenomena in which only a small portion of the wave 
surface plays a role are called Fresnel diffraction phenomena. 

Let us consider the Fresnel diffraction by a screen. From what we have just said, for a 
given point P only a small region at the edge of the screen is important for this diffraction. 
But over sufficiently small regions, the edge of the screen can always be considered to be a 
straight line. We shall therefore, from now on, understand the edge of the screen to mean 
just such a small straight line segment. 

We choose as the X Y plane a plane passing through the light source Q (Fig. 11) and 
through the line of the edge of the screen. Perpendicular to this, we choose the plane XZ so 
that it passes through the point Q and the point of observation P, at which we try to deter- 
mine the light intensity. Finally, we choose the origin of coordinates O on the line of the edge 
of the screen, after which the positions of all three axes are completely determined. 


Fic. 11. 
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Let the distance from the light source Q to the origin be D,. We denote the x-coordinate 
of the point of observation P by D,, and its z-coordinate, i.e. its distance from the XY 
plane, by d. According to geometrical optics, the light should pass only through points 
lying above the X Y plane; the region below the X Y plane is the region which according to 
geometrical optics should be in shadow (region of geometrical shadow). 

We now determine the distribution of light intensity on the screen near the edge of the 
geometrical shadow, i.e. for values of d small compared with D, and D,. A negative d 
means that the point P is located within the geometrical shadow. 

As the surface of integration in (59.2) we choose the half-plane passing through the line 
of the edge of the screen and perpendicular to the X Y plane. The coordinates x and y of 
points on this surface are related by the equation x = y tan a (a is the angle between the line 
of the edge of the screen and the Y axis), and the z-coordinate is positive. The field of the 
wave produced by the source Q, at the distance R, from it, is proportional to the factor 
e*®a. Therefore the field v on the surface of integretion is proportional to 


u ~ exp [ikv y? +z? t (D, 4 y tan a)?. 
In the integral (59.2) we must now substitute for R, 

R= Vy! (z-dy +(D,—y tan a). 
The slowly varying factors in the integrand are unimportant compared with the exponential. 
Therefore we may consider 1/R constant, and write dy dz in place of df,. We then find that 
the field at the point P is 


+o œ 
up ~ f fexp (ik(N (D, + y tan à? +y +2? 


-o00 


-X(D,—ytangy-(z—d)4y))dydz. (60.1) 
As we have already said, the light passing through the point P comes mainly from points 
of the plane of integration which are in the neighborhood of O. Therefore in the integral 


(60.1) only values of y and z which are small (compared with D, and D,) are important. For 
this reason we can write 


2 2 


2 9 > “+ 2° 
(D, +y tana) +y o zie D +> +y tana, 
2D, 
(z—dyt+y* 
V (D, -y tana + (z - dY + y! =D, + p — y tana. 
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We substitute this in (60.1). Since we are interested only in the field as a function of the 
distance d, the constant factor exp {ik(D,+D,)} can be omitted; the integral over y also 
gives an expression not containing d, so we omit it also. We then find 


1 i 
u~ | exp {ik 6 Peg C79) | de 
4 P 
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This expression can also be written in the form 


. Je ; Ds Sj 
2 4 21D, D, D, 
up ^ exp {ik xB} | exp ik P 4 P 


0 


dz. (60.2) 
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The light intensity is determined by the square of the field, that is, by the square modulus 
up|. Therefore, when calculating the intensity, the factor standing in front of the integral 
is irrelevant, since when multiplied by the complex conjugate expression it gives unity. An 
obvious substitution reduces the integral to 


up~ f e" dr, (60.3) 
where 
—H- 
=d jae uc. 60.4 
" 2D,(D, +D,) GORI 


Thus, the intensity / at the point P is: 


ae in? Lao] C(w^ IN Stay? ly 
E fe dq =3 1 (w )*5 T (Ww +5 3 (60.5) 
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2v: _ vz 
2 : P 
C(z)= ,/ - [ cos mdp, S(z)= ./ = | sinn? dq 
0n rid 
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are called the Fresne/ integrals. Formula (60.5) solves our problem of determining the light 
intensity as a function of d. The quantity 7, is the intensity in the illuminated region at 
points not too near the edge of the shadow; more precisely, at those points with w > | 
(C(oo) = S(oo) = 4 in the limit w — oc). 

The region of geometrical shadow corresponds to negative w. It is easy to find the 
asymptotic form of the function / (w) for large negative values of w. To do this we proceed 
as follows. Integrating by parts, we have 


where 


oc [ze] 


T 1 zer] an 

in? dn = — —--—. ei” a in 
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Integrating by parts once more on the right side of the equation and repeating this process, 
we obtain an expansion in powers of 1/]w|: 


f x Mn 1 1 
in iw? 
f e" dn=e |- zw] + aj] —.. 4. (60.6) 
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Although an infinite series of this type does not converge, nevertheless, because the suc- 
cessive terms decrease very rapidly for large values of |w], the first term already gives a good 
representation of the function on the left for sufficiently large |w| (such a series is said to be 
asymptotic). Thus, for the intensity /(w), (60.5), we obtain the following asymptotic formula, 
valid for large negative values of w: 
Io 
l=-~-5. 60.7 

4nw? D 
We see that in the region of geometric shadow, far from its edge, the intensity goes to zero 
as the inverse square of the distance from the edge of the shadow. 
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We now consider positive values of w, that is, the region above the X Y plane. We write 


9» to -w [A 


-w — 00 "o w 
For sufficiently large w, we can use an asymptotic representation for the integral standing 
on the right side of the equation, and we have 


20 


T ANO 
Í e" dn = (14 i) NES -— em, (60.8) 
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Substituting this expression in (60.5). we obtain 


" . " a 
rj-siniw—— 
1 4 

P=I9\1+ ,/-—--——— }. (60.9) 
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Thus in the illuminated region, far from the edge of the shadow, the intensity has an infinite 
sequence of maxima and minima. so that the ratio 7/7, oscillates on both sides of unity. 
With increasing w, the amplitude of these oscillations decreases inversely with the distance 
from the edge of the geometric shadow, and the positions of the maxima and minima steadily 
approach one another. 

For small w. the function 7(w) has qualitatively this same character (Fig. 12). In the region 
of the geometric shadow. the intensity decreases monotonically as we move away from the 
boundary of the shadow. (On the boundary itself, 7/7) = 1.) For positive w, the intensity has 
alternating maxima and minima. At the first (largest) maximum. 7/7, = 1.37. 


§ 61. Fraunhofer diffraction 


Of special interest for physical applications are those diffraction phenomena which occur 
when a plane parallel bundle of rays is incident on a screen. As a result of the diffraction, 
the beam ceases to be parallel. and there is light propagation along directions other than the 
initial one. Let us consider the problem of determining the distribution over direction of the 
intensity of the diffracted light at large distances beyond the screen (this formulation of the 
problem corresponds to Fraunhofer diffraction). Here we shall again restrict ourselves to the 
case of small deviations from geometrical optics. i.e. we shall assume that the angles of 
deviation of the rays from the initial direction (the diffraction angles) arc small. 


d. 
la 


Geometricai 
shadow 


Iunnated 
1eqion 


§ 61 FRAUNHOFER DIFFRACTION 155 


This problem can be solved by starting from the general formula (59.2) and passing to the 
limit where the light source and the point of observation are at infinite distances from the 
screen. A characteristic feature of the case we are considering is that, in the integral which 
determines the intensity of the diffracted light, the whole wave surface over which the integral 
is taken is important (in contrast to the case of Fresnel diffraction, where only the portions 
of the wave surface near the edge of the screens are important). 

However, it is simpler to treat this problem anew, without recourse to the general formula 
(59.2). 

Let us denote by wp the field which would exist beyond the screens if geometrical optics 
were rigorously valid. This field is a plane wave, but its cross section has certain regions 
(corresponding to the “shadows” of opaque screens) in which the field is zero. We denote 
by S the part of the plane cross-section on which the field ug is different from zero; since 
each such plane is a wave surface of the plane wave, ug = const over the whole surface S. 

Actually, however, a wave with a limited cross-sectional area cannot be strictly plane 
(see § 58). In its spatial Fourier expansion there appear components with wave vectors 
having different directions, and this is precisely the origin of the diffraction. 

Let us expand the field ug into a two-dimensional Fourier integral with respect to the co- 
ordinates y, z in the plane of the transverse cross-section of the wave. For the Fourier 
components, we have: 


wo | [me ttsas (61.1) 


where the vectors q are constant vectors in the y, z plane; the integration actually extends 
only over that portion S of the y, z plane on which wp is different from zero. If k is the wave 
vector of the incident wave, the field component uae ** gives the wave vector k' 2 k-4q. 
Thus the vector q = k' —k determines the change in the wave vector of the light in the diffrac- 
tion. Since the absolute values k = k’ = w/c, the small diffraction angles 0,, 0, in the xj- and 
xz-planes are related to the components of the vector q by the equations 


dom 6s. q=" ds (61.2) 
c c 

For small deviations from geometrical optics, the components in the expansion of the 
field uy can be assumed to be identical with the components of the actual field of the dif- 
fracted light, so that formula (61.1) solves our problem. 

The intensity distribution of the diffracted light is given by the square ||? as a function 
of the vector q. The quantitative connection with the intensity of the incident light is 
established by the formula 


f , dq, dq- 
| f ayaz =Í] Jual? Gat (61.3) 


T The criteria for Fresnel and Fraunhofer diffraction are easily found by returning to formula (60.2) and 
applying it, for example, to a slit of width a (instead of to the edge of an isoiated screen). The integration 
over z in (60.2) should then be taken between the limits from 0 to a. Fresnel diffraction corresponds to the 
case when the term containing z? in the exponent of the integrand is important, and the upper limit of the 
integral can be replaced by cc. For this to be the case, we must have 


ro DV. 
Fd bel etr 
A (5. 5)" 


On the other hand, if this inequality is reversed, the term in =? can be dropped; this corresponds to the case of 
Fraunhofer diffraction. 
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[compare (49.8)]. From this we see that the relative intensity diffracted into the solid angle 


do = d0,d0, is given by 
lu^ dq,dq._ (oV 
ul Quy | (s) 

Let us consider the Fraunhofer diffraction from two screens which are “complementary” : 
the first screen has holes where the second is opaque and conversely. We denote by u“? and 
uw? the field of the light diffracted by these screens (when the same light is incident in both 
cases). Since u4! and uj? are expressed by integrals (61.1) taken over the surfaces of the 
apertures in the screens, and since the apertures in the two screens complement one another 
to give the whole plane, the sum u, P? +u‘? is the Fourier component of the field obtained 
in the absence of the screens, i.e. it is simply the incident light. But the incident light is a 
rigorously plane wave with definite direction of prepagation, so that u, C +u‘? = 0 for all 
nonzero values of q. Thus we have ae? = =u”, or for the corresponding intensities, 

jua P|? = [u]? for q # 0. (61.5) 

This means that complementary screens give the same distribution of intensity of the 
diffracted light (this is called Babinet’s principle). 

We call attention here to one interesting consequence of the Babinet principle. Let us 
consider a blackbody, i.e. one which absorbs completely all the light falling on it. According 
to geometrical optics, when such a body is illuminated, there is produced behind it a region 
of geometrical shadow, whose cross-sectional area is equal to the area of the body in the 
direction perpendicular to the direction of incidence of the light. However, the presence of 
diffraction causes the light passing by the body to be partially deflected from its initial 
direction. As a result, at large distances behind the body there will not be complete shadow 
but, in addition to the light propagating in the original direction, there will also be a certain 
amount of light propagating at small angles to the original direction. It is easy to determine 
the intensity of this scattered light. To do this, we point out that according to Babinet’s 
principle, the amount of light deviated because of diffraction by the body under considera- 
tion is equal to the amount of light which would be deviated by diffraction from an aperture 
cut in an opaque screen, the shape and size of the aperture being the same as that of the 
transverse section of the body. But in Fraunhofer diffraction from an aperture all the light 
passing through the aperture is deflected. From this it follows that the total amount of light 
scattered by a blackbody is equa! to the amount of light falling on its surface and absorbed 
by it. 


2 
do. (61.4) 
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PROBLEMS 


1. Calculate the Fraunhofer diffraction of a plane wave normally incident on an infinite slit 
(of width 2a) with parallel sides cut in an opaque screen. 


Solution: We choose the plane of the slit as the yz plane, with the z axis along the slit (Fig. 13 
shows a section of the screen). For normally incident light, the plane of the slit is one of the wave 
surfaces, and we choose it as the surface of integration in (61.1). Since the slit is infinitely long, the 
light is deflected only in the xy plane [since the integral (61.1) becomes zero for q, +4 0]. 

Therefore the field should be expanded only in the y coordinate: 
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The intensity of the diffracted light in the angular range d@ is 

I ? di in? ka8 

ai = ned. do SIn OD jy; 

2a iuj 2n nak 82 

where k = w/c, and Is is the total intensity of the light incident on the slit. 
dl/d8 as a function of diffraction angle hes the form shown in Fig. 14. As 8 increases toward 

either side from 0 — 0, the intensity goes through a series of maxima with rapidly decreasing 
height. The successive maxima are separated by minima at the points 0 = nz/ka (where n is an 
integer); at the minima, the intensity falls to zero. 


sin? x 
xl 
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2. Calculate the Fraunhofer diffraction by a diffraction grating—a plane screen in which are cut 
a series of identical parallel slits (the width of the slits is 2a, the width of opaque screen between 
neighboring slits is 2b, and the number of slits is N ). 


Solution: We choose the plane of the grating as the yz plane, with the z axis parallel to the slits. 
Diffraction occurs only in the xy plane, and integration of (61.1) gives: 


N-1 ] —e-?2iNad 
a , -2inqd — y’ 
ug=ugy e ki Clg -2iqd* 
n=0 —e 


where d = a+b, and u; is the result of the integration over a single slit. Using the results of problem 
1, we get: 
dp = 10% sin Nqd \? {sin ga PrE Io sin Nk 6d ? sin? Rog do 
Nz singd qa Nnaak V sin k 0d 0 
(Jo is the total intensity of the light passing through all the slits). 
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For the case of a large number of slits (N--> oc), this formula can be written in another form. For 
values g = zn/d, where n is an integer, d//dg has a maximum; near such a maximum (i.e. for 


qd = nn >e, with c small) 
sin qa M? sin? Ne 
= E m d 
al oa ( qa ) nN “1 


But for N--»oc, we have the formulat 
. sin? Nx 
lim 


= (x). 
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We therefore have, in the neighborhood of each maximum: 


À 2 
deep (= 2 &(x) de, 
qa 
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i.e., in the limit the widths of the maxima are infinitely narrow and the total light intensity in the 
n'th maximum is 
m Y 
jop L Doa) 
ra n? 
3. Find the distribution of intensity over direction for the diffraction of light which is incident 
normal to the plane of a circular aperture of radius a. 


Solution: We introduce cylindrical coordinates z, r, with the z axis passing through the center 
of the aperture and perpendicular to its plane. Jt is obvious that the diffraction is symmetric about 
the z axis, so that the vector q has only a radial component q, = q = k0. Measuring the angle ¢ 
from the direction q, and integrating in (61.1) over the plane of the aperture, we find: 


aon a 
Ug = No | e~r cos? pr dd dr — 2nug | Jo(gr)r dr, 
ò o ò 


where Jo is the zero’th order Bessel function. Using the well-known formula 
a 


Í Jo(gr) rdr = ; J.(ag), 


0 
we then have 


Ug = 2n "3 Ji(aq), 


and according to (61.4) we obtain for the intensity of the light diffracted into the element of solid 
angle do: 

_ , JX(ak0) 
dI = Ip SN 


where /o is the total intensity of the light incident on the aperture. 


do, 


+ For x # Othe function on the left side of the equation is zero, while according to a well-known formula 


of the theory of Fourier series, 
m2 
lim (1 | fon 95729 ax) = f0. 
No ME Nx 


From this we see that the properties of this function actually coincide with those of the ó-function (see the 
footnote on p. 70). 


CHAPTER 8 


THE FIELD OF MOVING CHARGES 


§ 62. The retarded potentials 


In Chapter 5 we studied the constant field, produced by charges at rest, and in Chapter 6, 
the variable field in the absence of charges. Now we take up the study of varying fields in the 
presence of arbitrarily moving charges. 

We derive equations determining the potentials for arbitrarily moving charges. This 
derivation is most conveniently done in four-dimensional form, repeating the derivation at 
the end of § 46, with the one change that we use the second pair of Maxwell equations in the 
form (30.2) 


OF“ — 4n i 
xt cl 
The same right-hand side also appears in (46.8), and after imposing the Lorentz condition 
A! 14 
— =0, ie 1 96 +div A — 0, (62.1) 
àx' c Ot 
on the potentials, we get 
04 An, 
lO =f. 62.2 
ôx, ôx c J (62.2) 


This is the equation which determines the potentials of an arbitrary electromagnetic field. 
In three-dimensional form it is written as two equations, for A and for ¢: 


1 07A 4n 
AA- = L-—-—j 62.3 
i n (62.3) 
12? 
Ab- = = ? = —4mng. (62.4) 
c^ oat 


For constant fields, these reduce to the already familiar equations (36.4) and (43.4), and for 
variable fields without charges, to the homogeneous wave equation. 

As we know, the solution of the inhomogeneous linear equations (62.3) and (62.4) can be 
represented as the sum of the solution of these equations without the right-hand side, and a 
particular integral of these equations with the right-hand side. To find the particular solution, 
we divide the whole space into infinitely small regions and determine the field produced by 
the charges located in one of these volume elements. Because of the linearity of the field 
equations, the actual field will be the sum of the fields produced by all such elements. 

The charge de in a given volume element is, generally speaking, a function of the time. 
If we choose the origin of coordinates in the volume element under consideration, then the 

159 
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charge density is ọ = de(t) ó(R), where R is the distance from the origin. Thus we must 
solve the equation 


12 
^$- 3 a; = —4nde(t) 5(R). (62.5) 
Everywhere, except at the origin, 6(R) = 0, and we have the equation 
1 6?ó 
Aó—-——;-0 ; 
$- api (62.6) 


It is clear that in the case we are considering $ has central symmetry, i.e., $ is a function 
only of R. Therefore if we write the Laplace operator in spherical coordinates, (62.6) reduces 
to 


0. 


1 8 gi d o 
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To solve this equation, we make the substitution ¢ = y(R, 1)/R. Then, we find for x 


But this is the equation of plane waves, whose solution has the form (see $ 47): 


rej (i-2) tf (5) 


Since we only want a particular solution of the equation, it is sufficient to choose only one 
of the functions f, and f2. Usually it turns out to be convenient to take f; = 0 (concerning 
this, see below). Then, everywhere except at the origin, @ has the form 


Go. 
x(t- - 
= MET. 62.7) 
$ xo (62. 

So far the function x is arbitrary; we now choose it so that we also obtain the correct 
value for the potential at the origin. In other words, we must select x so that at the origin 
equation (62.5) is satisfied. This is easily done noting that as R — 0, the potential increases 
to infinity, and therefore its derivatives with respect to the coordinates increase more rapidly 
than its time derivative. Consequently as R — 0, we can, in equation (62.5), neglect the term 
(1/c2)/(0?/0t?) compared with Ad. Then (62.5) goes over into the familiar equation (36.9) 
leading to the Coulomb law. Thus, near the origin, (62.7) must go over into the Coulomb 
law, from which it follows that y(t) = de(t), that is, 


“(2 


que 


From this it is easy to get to the solution of equation (62.4) for an arbitrary distribution of 
charges o(x, y, z, t). To do this, it is sufficient to write de = o dV (dV is the volume element) 
and integrate over the whole space. To this solution of the inhomogeneous equation (62.4) 
we can still add the solution $$ of the same equation without the right-hand side. Thus, 


§ 63 THE LIENARD—WIECHERT POTENTIALS 161 


the general solution has the form: 


o(r, t) = IE. o (r. Tm 5) dV' +o, (62.8) 


R=r-r’, dV’ = dx' dy’ dz 
where 
r=(x, y, z) r =x, y’, zz) 
R is the distance from the volume element dV to the “field point"' at which we determine the 
potential. We shall write this expression briefly as 


o= | SEO dy + do, (62.9) 
where the subscript means that the quantity o is to be taken at the time ! — (R/c), and the 
prime on dV has been omitted. 

Similarly we have for the vector potential: 


a= [iwo av+Ao, (62.10) 
C. R 
where Ag is the solution of equation (62.3) without the right-hand term. 

The potentials (62.9) and (62.10) (without ġo and Aj) are called the retarded potentials. 

In case the charges are at rest (i.e. density o independent of the time), formula (62.9) goes 
over into the well-known formula (36.8) for the electrostatic field ; for the case of stationary 
motion of the charges, formula (62.10), after averaging. goes over into formula (43.5) for the 
vector potential of a constant magnetic field. 

The quantities A, and $ in (62.9) and 62.10) are to be determined so that the conditions 
of the problem are fulfilled. To do this it is clearlv sufficient to impose initial conditions, that 
is, to fix the values of the field at the initial time. However we do not usually have to deal 
with such initial conditions. Instead we are usually given conditions at large distances from 
the system of charges throughout all of time. Thus, we may be told that radiation is incident 
on the system from outside. Corresponding to this, the field which is developed as a result 
of the interaction of this radiation with the system can differ from the external field only by 
the radiation originating from the system. This radiation emitted by the system must, at large 
distances, have the form of waves spreading out from the system, that is, in the direction of 
increasing R. But precisely this condition is satisfied by the retarded potentials. Thus these 
solutions represent the field produced by the system, while $9 and Ag must be set equal to 
the external field acting on the system. 


§ 63. The Lienard-Wiechert potentials 


Let us determine the potentials for the field produced by a charge carrying out an assigned 
motion along a trajectory r = r(f). 

According to the formulas for the retarded potentials, the field at the point of observation 
P(x, y, z) at time t is determined by the state of motion of the charge at the earlier time t’, 
for which the time of propagation of the light signal from the point ro{t’), where the charge 
was located,to the field point P just coincides with the difference t—7'. Let R(t) = r—ro(t) 
be the radius vector from the charge e to the point P; like ro(/) it is a given function of the 
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time. Then the time ¢’ is determined by the equation 


4 (63.1) 


For each value of t this equation has just one root t.t 

In the system of reference in which the particle is at rest at time rt’, the potential at the 
point of observation at time 7 is just the Coulomb potential, 

e 
Rwy 
The expressions for the potentials in an arbitrary reference system can be found directly 


by finding a four-vector which for v = 0 coincides with the expressions just given for @ and 
A. Noting that, according to (63.1), $ in (63.2) can also be written in the form 


b= A=0. (63.2) 


e 
$ = cary 
we find that the required four-vector is: 
t ul 
Av= Ryu” (63.3) 


where uw is the four-velocity of the charge, R* = [e(t—t'), r—r'], where x’. y’, 2’, t’ are 
related by the equation (63.1), which in four-dimensional form is 


R,R*=0. (63.4) 


Now once more transforming to three-dimensional notation, we obtain, for the potentials 
of the field produced by an arbitrarily moving point charge, the following expressions: 


tho Seta EN (63.5) 


a a 


where R is the radius vector, taken from the point where the charge is located to the point 
of observation P, and all the quantities on the right sides of the equations must be evaluated 
at the time t’, determined from (63.1). The potentials of the field, in the form (63.5), are 
called the Lienard-Wiechert potentials. 

To calculate the intensities of the electric and magnetic fields from the formulas 


ó- 


10A 
E= em — grad $, H = curi A, 


we must differentiate @ and A with respect to the coordinates x, y, z of the point, and the 
time t of observation. But the formulas (63.5) express the potentials as functions of t’, and 
only through the relation (63.1) as implicit functions of x, y, z, t. Therefore to calculate the 


t This point is obvious but it can be verified directly. To do this we choose the field point P and the time 
of observation f as the origin O of the four-dimensional coordinate system and construct the light cone (§ 2) 
with its vertex at O. The lower half of the cone, containing the absolute past (with respect to the event O), is 
the geometrical locus of world points such that signals sent from them reach O. The points in which this 
hypersurface intersects the world line of the charge are precisely the roots of (63.1). But since the velocity of a 
particle is always less than the velocitv of light, the inclination of its world line relative to the time axis is 
everywhere less than the slope of the light cone. It then follows that the world line of the particle can inter- 
sect the lower half of the light cone in only one point. 
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required derivatives we must first calculate the derivatives of t’. Differentiating the relation 
R(t') = c(t—t') with respect to t, we get 


ôR Rr R-vor ( a) 
E __ =elt— 


ô wa R ót ét 
(The value of OR/Ot' is obtained by differentiating the identity R? = R? and substituting 
OR(t )/0t' = — v(t’). The minus sign is present because R is the radius vector from the charge 
e to the point P, and not the reverse.) 
Thus, 

ôr ] 

L mL. 63.6 

ôt 1 YR (63.6) 

|: Re 


Similarly, differentiating the same relation with respect to the coordinates, we find 


1 1/6R R 
grad i’ = — - grad R(t’) = — - e grad t' 4 - ) 
c € ACt R 


so that 


grad t! = — ———— —. (63.7) 


With the aid of these formulas, there is no difficulty in carrying out the calculation of the 
fields E and H. Omitting the intermediate calculations, we give the final results: 


v 
1-5 
Ese 5 ln. — ê (r-* ; 
e eas (R DLE ; go Rx {(R LR xv, (63.8) 
rR- =") c R- TT) 


- z RxE. (63.9) 


Here, * = dv/dt’; all quantities on the right sides of the equations refer to the time t’. It is 
interesting to note that the magnetic field turns out to be everywhere perpendicular to the 
electric. 

The electric field (63.8) consists of two parts of different type. The first term depends only 
on the velocity of the particle (and not on its acceleration) and varies at large distances like 
1/R?. The second term depends on the acceleration, and for large R it varies like 1/R. Later 
(§ 66) we shall see that this latter term is related to the electromagnetic waves radiated by the 
particle. 

As for the first term, since it is independent of the acceleration it must correspond to the 
field produced by a uniformly moving charge. In fact, for constant velocity the difference 


Ry — ` Ry = Ry =w=’) 


is the distance R, from the charge to the point of observation at precisely the moment of 
observation. It is also easy to show directly that 


1 Q I 2 o 2 
Re= z Reys Rr — 3 (Yx R) =R, 1— 5 sin 0, 
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where 0, is the angle between R, and v. Consequently the first term in (63.8) is identical 
with the expression (38.8). 


PROBLEM 


Derive the Lienard-Wiechert potentials by integrating (62.9-10). 
Solution: We write formula (62.8) in the form: 


or, t) = ife if) s( -t i! Ir -n)jea 
ir —ri c 


(and similarly for A(r, /)), introducing the additional delta function and thus eliminating the 
irnplicit arguments in the function o. For a point charge, moving in a trajectory r = ro(t), we have: 


e(r', t) = eó[r —ro(1)]. 


Substituting this expression and integrating over dV’, we get: 


dt 1 | 
é(r, t) =e ll kr] ô I r-r]. 


The t integration is done using the formula 

à(r—1) 
Fit’) 
[where ¢’ is the root of F(t’) = 0], and gives formula (63.5). 


a[F()] = 


§ 64. Spectral resolution of the retarded potentials 


The field produced by moving charges can be expanded into monochromatic waves. The 
potentials of the different monochromatic components of the field have the form $, e^ '^', 
A „e` '*', The charge and current densities of the system of charges producing the field can 
also be expanded in a Fourier series or integral. It is clear that each Fourier component of o 
and jis responsible for the creation of the corresponding monochromatic component of the 
field. 

In order to express the Fourier components of the field in terms of the Fourier components 
of the charge density and current, we substitute in (62.9) for $ and o respectively, ¢,,e7' 
and g,,e7''. We then obtain 

` R 
e io(t z) 


$e = fos -R dV. 


Factoring e^" and introducing the absolute value of the wave vector k = w/c, we have: 
e^ 
= — dV, 64.1 
bo= | 00% (64.1) 
Similarly, for A,, we get 


A, = f je =z dv. (64.2) 
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We note that formula (64.1) represents a generalization of the solution of the Poisson 
equation to a more general equation of the form 


A, t kó, = —4n0, (64.3) 

(obtained from equations (62.4) for o, @ depending on the time through the factor e^ /**). 

If we were dealing with expansion into a Fourier integral, then the Fourier components 
of the charge density would be 


+a 
Qo | eel dt. 
J 


Substituting this expression in (64.1), we get 


to 
$a = Í j n gem IV dt. (64.4) 


We must still go over from the continuous distribution of charge density to the point charges 
whose motion we are actually considering. Thus, if there is just one point charge, we set 


o = eó[r —ro(1)], 
where ro(t) is the radius vector of the charge, and is a given function of the time. Substituting 


this expression in (64.4) and carrying out the space integration [which reduces to replacing 
r by ro(t )], we get: 


oo 


- 
by =e I [n C (64.5) 


where now R(t) is the distance from the moving particle to the point of observation. 
Similarly we find for the vector potential: 
e v(t). 
A, =- | a eel ROI dt, 64.6 
^i [moe (64.6) 


= pG 
where v = F(t) is the velocity of the particle. 

Formulas analogous to (64.5), (64.6) can also be written for the case where the spectral 
resolution of the charge and current densities contains a discrete series of frequencies. Thus, 
for a periodic motion of a point charge (with period T = 27/w ) the spectral resolution of the 
field contains only frequencies of the form nwọ, and the corresponding components of the 


vector potential are 
T 


A E £ v(t) einolt + RONI gy (64.7) 


cT J R(t) 
0 


(and similarly for $,). In both (64.6) and (64.7) the Fourier components are defined in 
accordance with $ 49. 


PROBLEM 


Find the expansion in plane waves of the field of a charge in uniform rectilinear motion. 

Solution: We proceed in similar fashion to that used in $ 51. We write the charge density in the 
form 0 =e d(r—vt), where v is the velocity of the particle. Taking Fourier components of the 
equation Dé = —4ze ó(r —vt), we find (Dé) = —4nee v Vt, 
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On the other hand, from 


i d3k 
= tker mis 
g= | etos 
we have 
= i 1 66, 
(Cé) = —A?ó E 
Thus, 
a2 
: - us : K?ó, = 4nee oko vo 
C cop 
from which, finally 
e xk vx 
(y == 4ne 


k?— ( zy 
€ 


From this it follows that the wave with wave vector k has the frequency « = k : v. Similarly, we 
obtain for the vector potential, 


ve^ uk wt 
Ax = ne - 2° 
pe- [EI 
c 
Finally, we have for the fields, 
(k:v) 
—k- v 
k-y ; c 
E, = —ikó, gioco ApS Rel ky gie Hv 
k?— | -— 
C) 
4 kx v 
H; —ikXA, = i i —- - eoi vt, 


§ 65. The Lagrangian to terms of second order 


In ordinary classical mechanics, we can describe a system of particles interacting with 
each other with the aid of a Lagrangian which depends only on the coordinates and velocities 
of these particles (at one and the same time). The possibility of doing this is, in the last 
analysis. dependent on the fact that in mechanics the velocity of propagation of interactions 
is assumed to be infinite. 

We already know that because of the finite velocity of propagation, the field must be 
considered as an independent system with its own "degrees of freedom". From this it follows 
that if we have a system of interacting particles (charges). then to describe it we must consider 
the system consisting of these particles and the field. Therefore, when we take into account 
the finite velocity of propagation of interactions, it is impossible to describe the system of 
interacting particles rigorously with the aid of a Lagrangian, depending only on the co- 
ordinates and velocities of the particles and containing no quantities related to the internal 
“degrees of freedom" of the field. 

However, if the velocity v of all the particles is small compared with the velocity of light, 
then the system can be described by a certain approximate Lagrangian. It turns out to be 
possible to introduce a Lagrangian describing the system, not only when all powers of v/c 
are neglected (classical Lagrangian), but also to terms of second order, v?/c?. This last 
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remark is related to the fact that the radiation of electromagnetic waves by moving charges 
(and consequently, the appearance of a ''self "-field) occurs only in the third approximation 
in v/c (see later, in § 67). 

As a preliminary, we note that in zero’th approximation, that is, when we completely 
neglect the retardation of the potentials, the Lagrangian for a system of charges has the form 

I9 =F im,vi- y 22 (65.1) 
a a>b Ras 

(the summation extends over the charges which make up the system). The second term is the 
potential energy of interaction as it would be for charges at rest. 

To get the next approximation, we proceed in the following fashion. The Lagrangian for a 
charge e, in an external field is 


2 va A 
L,- —mgc 1-3 -66t o Arv (65.2) 


Choosing any one of the charges of the system, we determine the potentials of the field 
produced by all the other charges at the position of the first, and express them in terms of the 
coordinates and velocities of the charges which produce this field (this can be done only 
approximately—for @, to terms of order c?/c?, and for A, to terms in v/c). Substituting the 
expressions for the potentials obtained in this way in (65.2), we get the Lagrangian for one of 
the charges of the system (for a given motion of the other charges). From this, one can then 
easily find the Lagrangian for the whole system. 
We start from the expressions for the retarded potentials 


€r- Ric 1 
= | Re qy, A=- 
? f R c 


If the velocities of all the charges are small compared with the velocity of light, then the 
charge distribution does not change significantly during the time R/c. Therefore we can 


expand Q,- rje and j,- 4, in series of powers of R/c. For the scalar potential we thus find, to 
terms of second order: 


dV 10 1 æ 
$- et -ia feart z f Reav 


h- Ric 
—— dV. 
R 


R c ét 2c? or 
(o without indices is the value of o at time ¢; the time differentiations can clearly be taken 
out from under the integral sign). But f odV is the constant total charge of the system. 
Therefore the second term in our expression is zero, so that 
odV 1# 
= -—-—zlRedV. 65.3 
$ Í R 2an je (65.3) 
We can proceed similarly with A. But the expression for the vector potential in terms of 
the current density already contains 1/c, and when substituted in the Lagrangian is multiplied 
once more by 1/c. Since we are looking for a Lagrangian which is correct only to terms of 
second order, we can limit ourselves to the first term in the expansion of A, that is, 


1 f ov 
-- [ £ ay (65.4) 
(we have substituted j = ov). 
+ For systems consisting of particles with the same charge-to-mass ratio, the appearance of radiation is put off 


to the fifth approximation in v/c; in such a case there is a Lagrangian to terms of fourth order in v/c. [See B. M. 
Barker and R. F. O’Connel, Can. J. Phys. 58, 1659 (1980).] 
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Let us first assume that there is only a single point charge e. Then we obtain from (65.3) 
and (65.4), 

e | e QR ev 
=- LI A=—, 65.5 

R * 2d OF cR (23) 
where R is the distance from the charge. 

We choose in place of $ and A other potentials $' and A’, making the transformation (see 


§ 18): 


$ 


$- E A’ — A + grad f, 
c ôt 
in which we choose for f the function 
EE OR 
~ 2c ét 
Then we gett 
e ev e eR 
pend A’ LU dn —, 
$ R cR 2c dt 


To calculate A’ we note first of all that V(OR/Ot) = (ô /ĉt)V R. The grad operator here 
means differentiation with respect to the coordinates of the field point at which we seek the 
value of A'. Therefore VR is the unit vector n, directed from the charge e to the field point, 
so that 

ev e. 

— cR x ae 


: (5) R RR 
A= = 


TOR eR Re 


A’ 


We also write: 


But the derivative — R for a given field point is the velocity v of the charge, and the derivative 
R is easily determined by differentiating R? = R’, that is, by writing 


RR=R-R= -R-v. 


Thus, 
—v+n(n-v) 
= x . 
Substituting this in the expression for A', we get finally: 
e e[v - (v: n)n] 
taste A =s D, l 
d 2cR (95.9) 


If there are several charges then we must, clearly. sum these expressions over all the charges. 

Substituting these expressions in (65.2), we obtain the Lagrangian L, for the charge e, 
(for a fixed motion of the other charges). In doing this we must also expand the first term in 
(65.2) in powers of v,/c, retaining terms up to the second order. Thus we find: 


2 4 
Mava lm, , €, 2 €p 
L, = "es =e? z t zai n Ya Yot (Vo * May (Vp * Na 
2 8 c? LR. 2c? z Ra l b ( ot b »)] 


(the summation goes over all the charges except e, ; na is the unit vector from e, to e,). 


+ These potentials no longer satisfy the Lorentz condition (62.1), nor the equations (62.3-4). 
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From this, it is no longer difficult to get the Lagrangian for the whole system. It is easy 
to convince oneself that this function is not the sum of the L, for all the charges, but has the 
form 


pay ee yy Se y e SCC. E 
=LI 4&8 SR, 4X m Va Va (Va na) na)J. 


Actually, for each of the charges under a given motion of all the others, this function L 
goes over into L, as given above. The expression (65.7) determines the Lagrangian of a 
system of charges correctly to terms of second order. (1t was first obtained by C. G. Darwin, 
1922.) 

Finally we find the Hamiltonian of a system of charges in this same approximation. This 
could be done by the general rule for calculating ¥ from L; however it is simpler to proceed 
as follows. The second and fourth terms in (65.7) are small corrections to L'?? (65.1). On the 
other hand, we know from mechanics that for small changes of L and X. the additions to 
them are equal in magnitude and opposite in sign (here the variations of L are considered 
for constant coordinates and velocities, while the changes in # refer to constant coordinates 
and momenta).t 

Therefore we can at once write #, subtracting from 

(0) y Pa fae 
= Lom, BR 
the second and fourth terms of (65.7), replacing the velocities in them by the first approxima- 
tion v, = ie i Thus 


€, €, 
"Xx m; Tom Roo 
Y L4 — [p, Po + (ps nap, Mas] (65.8) 
45 2c?m NM Pa ` Po t (Pa ^ Map p» ^ Da; ; 
PROBLEMS 


1. Determine (correctly to terms of second order) the center of inertia of a system of interacting 
particles. 
Solution: The problem is solved most simply by using the formula 


b3 afat [| WrdV 
EDIDI 


[see (14.6)], where 4, is the kinetic energy of the particle (including its rest energy), and W is the 
energy density of the field produced by the particles. Since the &, contain the large quantities 
m, c?, it is sufficient, in obtaining the next approximation, to consider only those terms in &, and 
W which do not contain c, i.e. we need consider only the nonrelativistic kinetic energy of the particles 
and the energy of the electrostatic field. We then have: 


i oe? 1 o? 1 f ] 


T See Mechanics, § 40. 
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the integral over the infinitely distant surface vanishes; the second integral also is transformed 
into a surface integral and vanishes, while we substitute Ag = — zo in the third integral and obtain: 


1 l 
f Weav=3 [ por dv m F eetere 


where ¢, is the potential produced at the point ra by all the charges other than e,.T 
Finally, we get: 


1 Pa? | Caw Eb 
a 2. P9. bs 
R pte (met Fe + 25 Ra, 


(with a summation over all b except b — a), where 


2 
emy (mei f us se) 
a ab 


a a >b 


is the totał energy of the system. Thus in this approximation the coordinates of the center of inertia 
can actually be expressed in terms of quantities referring only to the particles. 

2. Write the Hamiltonian in second approximation for a system of two particles, omitting the 
motion of the system as a whole. 

Solution: We choose a system of reference in which the total momentum of the two particles is 
zero. Expressing the momenta as derivatives of the action, we have 


Pı po = 2S/ér, +- oS/or; = 0. 


From this it is clear that in the reference system chosen the action is a function of r = r;—r;, the 
difference of the radius vectors of the two particles. Therefore we have p; = —p, =p, where 
p = ¢S/cr is the momentum of the relative motion of the particles. The Hamiltonian is 


1/1 1 1 1 1 e 2 eez 
mpm Le E SA 4. 1 T 2 -nY? 
2 (s. a) R 8c? (s : a dio r K 2m, mz cr [p T(p n) l 


t The elimination of the self-field of the particles corresponds to the mass “renormalization” mentioned 
in the footnote on p. 90). 


CHAPTER 9 


RADIATION OF ELECTROMAGNETIC WAVES 


§ 66. The field of a system of charges at large distances 


We consider the field produced by a system of moving charges at distances large compared 
with the dimensions of the system. 

We choose the origin of coordinates O anywhere in the interior of the system of charges. 
The radius vector from O to the point P, where we determine the field, we denote by Ro, 
and the unit vector in this direction by n. Let the radius vector of the charge element 
de = odV be r, and the radius vector from de to the point P be R. Obviously R = R,-r. 

At large distances from the system of charges, Ry > r, and we have approximately, 


R = |R,—rj = Ro—r:n. 


We substitute this in formulas (62.9), (62.10) for the retarded potentials. In the denominator 
of the integrands we can neglect r:n compared with Rọ. In t—(R/c), however, this is 
generally not possible; whether it is possible to neglect these terms is determined not by the 
relative values of Ro/c and r - (n/c), but by how much the quantities o and j change during 
the time r (n/c). Since Rp is constant in the integration and can be taken out from under the 
integral sign, we get for the potentials of the field at large distances from the system of 
charges the expressions: 


1 
$7 gc | actus". (66.1) 
2 66.2 
= cR, E AV. ( . ) 


At sufficiently large distances from the system of charges, the field over small regions of 
space can be considered to be a plane wave. For this it is necessary that the distance be large 
compared not only with the dimensions of the system, but also with the wavelength of the 
electromagnetic waves radiated by the system. We refer to this region of space as the ware 
zone of the radiation. 

In a plane wave, the fields E and H are related to each other by (47.4), E = H xn. Since 
H = curl A, it is sufficient for a complete determination of the field in the wave zone to 
calculate only the vector potential. In a plane wave we have H = (1/c)À xn [see (47.3)], 
where the dot indicates differentiation with respect to time.t Thus, knowing A, we find H 


+ In the present case, this formula is easily verified also by direct computation of the curl of the expression 
(66.2), and dropping terms in 1/R$ in comparison with terms ~ 1/R,. 
171 
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and E from the formulas:t 


l, E- 
H= Axe E= (Axn)xn. (66.3) 


We note that the feld at large distances is inversely proportional to the first power of the 
distance Rọ from the radiating system. We also note that the time t enters into the expressions 
(66.1) to (66.3) always in the combination 1 — (Roc). 

For the radiation produced by a single arbitrarily moving point charge, it turns out to be 
convenient to use the Lienard-Wiechert potentials. At large distances, we can replace the 
radius vector R in formula (63.5) by the constant vector Ro, and in the condition (63.1) 
determining ¢’, we must set R = Ro—r, : n(ro(f) is the radius vector of the charge). Thus,f 


tU 
Pae ev( 2 T (66.4) 
c 
where ¢’ is determined from the equality 
Ü R 
pl. cio e (66.5) 
c c 


The radiated electromagnetic waves carry off energv. The energy flux is given by the 
Poynting vector which, for a plane wave, is 
2 
S=c—n. 
ean 
The intensity d/ of radiation into the element of solid angle do is defined as the amount of 
energy passing in unit time through the element df = R2 do of the spherical surface with 
center at the origin and radius Rọ. This quantity is clearly equal to the energy flux density S 
multiplied by df, i.e. 


H? 
dI = c 7 Rédo. (66.6) 


Since the field H is inversely proportional to Ro, we see that the amount of energy radiated 
by the system in unit time into the element of solid angle do is the same for all distances (if 
the values of t —(Ro/c) are the same for them). This is, of course, as it should be, since the 
energy radiated from the system spreads out with velocity c into the surrounding space, not 
accumulating or disappearing anywhere. 

We derive the formulas for the spectral resolution of the field of the waves radiated by the 
system. These formulas can be obtained directly from those in $ 64. Substituting in (64.2) 
R = Ro—r:n (in which we can set R = Ry in the denominator of the integrand), we get for 
the Fourier components of the vector potential: 


eikRo ; 
A= fira (66.7) 
cRo 


(where k = kn). The components H,, and E, are determined using formula (66.3). Sub- 
stituting in it for H, E, A, respectively, H, e |, E, e^'^', A, e '^', and then dividing by 


T The formula E = —(1/c)A [see (47.3)] is here not applicable to the potentials ¢, A, since they do not 
satisfy the same auxiliary condition as was imposed on them in § 47. 

1 In formula (63.8) for the electric field, the present approximation corresponds to dropping the first 
term in comparison with the second. 
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ei we find 
H,=ikxA,, E= E (kx A,) xk. (66.8) 


When speaking of the spectral distribution of the intensity of radiation, we must dis- 
tinguish between expansions in Fourier series and Fourier integrals. We deal with the expan- 
sion into a Fourier integral in the case of the radiation accompanying the collision of charged 
particles. In this case the quantity of interest is the total amount of energy radiated during the 
time of the collision (and correspondingly lost by the colliding particles). Suppose dé na is the 
energy radiated into the element of solid angle do in the form of waves with frequencies in 
the interval dw. According to the general formula (49.8), the part of the total radiation lying 
in the frequency interval dc/2z is obtained from the usual formula for the intensity by 
replacing the square of the field by the square modulus of its Fourier component and multi- 
plying by 2. Therefore we have in place of (66.6): 


do 
2n 

If the charges carry out a periodic motion, then the radiation field must be expanded in a 
Fourier series. According to the general formula (49.4) the intensities of the various com- 
ponents of the Fourier resolution are obtained from the usual formula for the intensity by 


replacing the field by the Fourier components and then multiplying by two. Thus the intensity 
of the radiation into the element of solid angle do, with frequency œw = noy equals 


ME 2p2 
dE xy = F [Ho] R5 do (66.9) 


= L |W |2R2 
dl, = F |H,|? RÀ do. (66.10) 


Finally, we give the formulas for determining the Fourier components of the radiation 
field directly from the given motion of the radiating charges. For the Fourier integral 
expansion, we have: 


jo = i je’ dt. 


Substituting this in (66.7) and changing from the continuous distribution of currents to a 
point charge moving along a trajectory ro = ro(t) (see § 64), we obtain: 


+o 


Í ev(tye Ut 7k o dt, (66.11) 


— oo 


Since v = dro/dt, vdt = dr, and this formula can also be written in the form of a line 
integral taken along the trajectory of the charge: 


eikRo 


A, = 
"^ cR 


e*Ro (^. 
A, =e cR, | aereas (66.12) 
According to (66.8), the Fourier components of the magnetic field have the form: 
iwe S S cq 
=e OR, fe n x dro. (66.13) 


If the charge carries out a periodic motion in a closed trajectory, then the field must be 
expanded in a Fourier series. The components of the Fourier series expansion are obtained 
by replacing the integration over all times in formulas (66.11) to (66.13) by an average over 
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the period T of the motion (see § 49). For the Fourier component of the magnetic field with 
frequency w = no = n(2n/T). we have 


1 
2nine*Re 


H, = e——;—— gilnwot—k roD] x v(t) dt 
0 


Pa oik Ro 
=e ile $ efron x dro, (66.14) 
[ui 0 


In the second integral. the integration goes over the closed orbit of the particle. 


PROBLEM 


Find the four-dimensional expression for the spectral resolution of the four-momentum radiated 
by a charge moving along a given trajectory. 

Solution: Substituting (66.8) in (66.9), and using the fact that, because of the condition (62.1), 
kġa =k: An, we find: 


c do 
dé, = S (3 Ao]? —|k : A?) R2 do 2: 
ck? 3 ap; 70 _ ck? n pra do 
ES 2n (lA, —| bu ) Ro do 2a yo UT 55 ^: ^5 R; do Fi 
Representing the four-potential A,,, in a form analogous to (66.12), we get: 
ke? 
dE nw me 2 A z'* do dk, 


where 7' denotes the four-vector 
gi = | exp (—ik: x?) dx’ 


and the integration is performed along the world line of the trajectory of the particle. Finally, 
changing to four-dimensional notation [including the four-dimensional "volume element" in 
k-space, as in (10.1a)], we find for the radiated four-momentum: 
2 fet 
dPi— f Sk km)d*k. 


2n?c 


§ 67. Dipole radiation 


The time r-(n/c) in the integrands of the expressions (66.1) and (66.2) for the retarded 
potentials can be neglected in cases where the distribution of charge changes little during 
this time. It is easy to find the conditions for satisfying this requirement. Let T denote the 
order of magnitude of the time during which the distribution of the charges in the system 
changes significantly. The radiation of the system will obviously contain periods of order 
T (i.e. frequencies of order 1/T). We further denote by a the order of magnitude of the 
dimensions of the system. Then the time r - (n/c) ~ a/c. In order that the distribution of the 
charges in the system shall not undergo a significant change during this time, it is necessary 
that a/c « T. But cT. is just the wavelength 4 of the radiation. Thus the condition a < cT 
can be written in the form 

agi, (67.1) 
that is, the dimensions of the system must be small compared with the radiated wavelength. 
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We note that this same condition (67.1) can also be obtained from (66.7). In the integrand, 
r goes through values in an interval of the order of the dimensions of the system, since outside 
the system j is zero. Therefore the exponent ikr is small, and can be neglected for those 
waves in which ka < 1, which is equivalent to (67.1). 

This condition can be written in still another form by noting that T ~ a/v, so that À ~ cajr, 
if v is of the order of magnitude of the velocities of the charges. From a « 2, we then find 


v «c, (67.2) 


that is, the velocities of the charges must be small compared with the velocity of light. 
We shall assume that this condition is fulfilled, and take up the study of the radiation at 
distances from the radiating system large compared with the wavelength (and consequently, 
in any case, large compared with the dimensions of the system). As was pointed out in $ 66, 
at such distances the field can be considered as a plane wave, and therefore in determining 
the field it is sufficient to calculate only the vector potential. 
The vector potential (66.2) of the field now has the form 


| f. 
A= — fiav, (67.3) 
cRo 


where the time ¢’ = t—(R,/c) now no longer depends on the variable of integration. Sub- 
stituting j = ov, we rewrite (67.3) in the form 


1 
A= cR, (> ev) 


(the summation goes over all the charges of the system; for brevity, we omit the index 1’— 
all quantities on the right side of the equation refer to time 7’). But 


Ye-TYe-d 


where d is the dipole moment of the system. Thus, 
1 


A= — d. 67.4 
RR; (67.4) 
With the aid of formula (66.3) we find that the magnetic field is equal to 
1. 
H= cR d xn, (67.5) 
and the electric field to . 
1, 
E= ZR, (dxn)xn. (67.6) 


We note that in the approximation considered here, the radiaticn is determined by the 
second derivative of the dipole moment of the system. Radiation of this kind is called dipole 
radiation. 

Since d = E er, d = X ev. Thus the charges can radiate only if they move with acceleration. 
Charges in uniform motion do not radiate. This also follows directly from the principle of 
relativity, since a charge in uniform motion can be considered in the inertial system in which 
it is at rest, and a charge at rest does not radiate. 
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Substituting (67.5) in (66.6), we get the intensity of the dipole radiation: 
i2 


l2 " d 
dl = xm doces 


; sin? 0 do, (67.7) 


where @ is the angle between d and n. This is the amount of energy radiated by the system 
in unit time into the element of solid angle do. We note that the angular distribution of 
the radiation is given by the factor sin’ 0. 

Substituting do = 2z sin 0 d0 and integrating over 0 from 0 to z, we find for the total 
radiation 


T (67.8) 


If we have just one charge moving in the external field, then d = er and d = ew, where w 
is the acceleration of the charge. Thus the total radiation of the moving charge is 


l=, (67.9) 


We note that a closed system of particles. for all of which the ratio of charge to mass is 
the same, cannot radiate (by dipole radiation). In fact, for such a system, the dipole moment 


d=) er = x- — , mt = = const } mr, 


where const is the charge-to-mass ratio common to all the charges. But X mr = RÈ m, 
where R is the radius vector of the center of inertia of the system (remember that all of the 
velocities are small, v < c, so that non-relativistic mechanics is applicable). Therefore d is 
proportional to the acceleration of the center of inertia, which is zero, since the center of 
inertia moves uniformly. 

Finally, we give the formula for the spectral resolution of the intensity of dipole radiation. 
For radiation accompanying a collision. we introduce the quantity d&,, of energy radiated 
throughout the time of the collision in the form of waves with frequencies in the interval 
dw/2n (sce 8 66). It is obtained by replacing the vector d in (67.8) by its Fourier component 
d,, and multiplying by 2: 


4 . dw 
dé,=— MIL 
ZI NE 


For determining the Fourier components, we have 
> 


Hg Cabs id - NET 
d,e ‘= za due °) = —w*die wt 


from which d,, =- —«?d,. Thus, we get 


4o* , do 


HU = os d 2n 


(67.10) 


w 


For periodic motion of the particles. we obtain in similar fashion the intensity of radiation 
with frequency o = noy in the form 


= “eon 


la,]?. (67.11) 


§ 67 DIPOLE RADIATION 177 


PROBLEMS 


1. Find the radiation from a dipole d, rotating in a plane with constant angular velocity Q.t 
Solution: Choosing the plane of the rotation as the x, y plane, we have: 


d, = dọ cos Qt, d, = d, sin Qr. 


Since these functions are monochromatic, the radiation is also monochromatic, with frequency 

w = Q. From formula (67.7) we find for the angular distribution of the radiation (averaged over 
the period of the rotation): 

= dio 

dl = — =~ 

87c? 


where 0 is the angle between the direction n of the radiation and the z axis. The total radiation is 


(1--cos? 0)do, 


The polarization of the radiation is along the vector d xn — c?n»d. Resolving it into com- 
ponents in the n, z plane and perpendicular to it, we find that the radiation is elliptically polarized, 
and that the ratio of the axes of the ellipse is equal to n, — cos 0; in particular, the radiation 
along the z axis is circularly polarized. 

2. Determine the angular distribution of the radiation from a system of charges, moving as a 
whole (with velocity v), if the distribution of the radiation is known in the reference system in which 
the system is at rest as a whole. 

Solution: Let 

dl’ -=f(cos 0, à) do', do’ = d(cos 0) d¢’ 


be the intensity of the radiation in the K' frame which is attached to the moving charge system 
(8, ¢’ are the polar coordinates; the polar axis is along the direction of motion of the system). The 
energy d4 radiated during a time interval dr in the fixed (laboratory) reference frame K, is related 
to the energy dé’ radiated in the K’ system by the transformation formula 


(the momentum of radiation propagating in a given direction is related to its energy by the equation 
[d P| = dé/c). The polar angles, 0, 0' of the direction of the radiation in the K and K’ frames are 
related by formulas (5.6), and the azimuths ¢ and ¢’ are equal. Finally, the time interval dr’ in the 
K' system corresponds to the time 


dt’ 


in the K system. 
As a result, we find for the intensity d/ = (dé/dr) do in the K system: 


. y? y 
(i- 2) cos 0— —- 
et [4 

V V 


Thus, for a dipole moving along the direction of its own axis, f = const - sin? 0', and by using the 


f The radiation from a rotator or a symmetric top which has a dipole moment is of this type. In the first 
case, d is the total dipole moment of the rotator; in the second case d is the projection of the dipole moment 
of the top on a plane perpendicular to its axis of precession (i.e. the direction of the total angular momentum). 
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y2\3 
(1- = sin? 6 
- do. 


dl = const : (Eee 


V 5 
]— — cos ) 
€ 


formula just obtained, we find: 


§ 68. Dipole radiation during collisions 


In problems of radiation during collisions, one is seldom interested in the radiation 
accompanying the collision of two particles moving along definite trajectories. Usually we 
have to consider the scattering of a whole beam of particles moving parallel to each other, 
and the problem consists in determining the total radiation per unit current density of 
particles. 

If the current density is unity, i.e. if one particle passes per unit time across unit area of 
the cross-section of the beam, then the number of particles in the flux which have “impact 
parameters" between g and g+dgq is 219 dg (the area of the ring bounded by the circles of 
radius 9 and g+dg). Therefore the required total radiation is gotten by multiplying the total 
radiation Aŝ from a single particle (with given impact parameter) by 27g dọ and integrat- 
ing over ¢ from 0 to oo. The quantity determined in this way has the dimensions of energy 
times area. We call it the effective radiation (in analogy to the effective cross-section for 
scattering) and denote it by x:t 


x= | A&-2nodo. (68.1) 
IU 


We can determine in completely analogous manner the effective radiation in a given solid 
angle element do, in a given frequency interval dw, etc.? 

We derive the general formula for the angular distribution of radiation emitted in the 
scattering of a beam of particles by a centrally symmetric field, assuming dipole radiation. 

The intensity of the radiation (at a given time) from each of the particles of the beam under 
consideration is determined by formula (67.7). in which d is the dipole moment of the particle 
relative to the scattering center.$ First of all we average this expression over all directions of 
the vectors d in the plane perpendicular to the beam direction. Since (d x n = d? — (n - dy?^, 
the averaging affects only (n :d)?. Because the scattering field is centrally symmetric and the 
incident beam is parallel, the scattering. and also the radiation, has axial symmetry around 
an axis passing through the center. We choose this axis as x axis. From symmetry, it is 
obvious that the first powers d,, d, give zero on averaging, and since d , is not subjected to 
the averaging process, 


"REP 
The average values of d? and d? are equal to each other, so that 
d; = d; = (4) - d3]. 


t The ratio of z to the energy of the radiating system is called the cross section for energy loss by radiation. 

+ If the expression to be integrated depends on the angle of orientation of the projection of the dipole 
moment of the particle on the planc transverse to the beam, then we must first average over all directions in 
this plane and only then multiply by 2zo do and integrate. 

$ Actually one usually deals with the dipole moment of two particles—the scatterer and the scattered 
particle— relative to their common center of inertia. 
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Keeping all this in mind, we find without difficulty: 
(d x n? = 4(d? + 42) + 4(d? — 3d2) cos? 0, 
where 0 is the angle between the direction n of the radiation and the x axis. 
Integrating the intensity over the time and over all impact parameters, we obtain the 


following final expression giving the effective radiation as a function of the direction of 
radiation: 


do 3 cos? 0—1 
dx, = —3|A4B ————— 
iain E B 5 |. (68.2) 
where 
vc +o ce cto 
2 . 1 . . 
-il Í d? dt2n0 do, B=, [ Í (d? — 3d?) dt 270 do. (68.3) 
0 -o 0 -œ 


The second term in (68.2) is written in such a form that it gives zero when averaged over all 
directions, so that the total effective radiation is x = A/c?. We call attention to the fact that 
the angular distribution of the radiation is symmetric with respect to the plane passing 
through the scattering center and perpendicular to the beam, since the expression (68.2) is 
unchanged if we replace 0 by 1 — 0. This property is specific to dipole radiation, and is no 
longer true for higher approximations in v/c. 

The intensity of the radiation accompanying the scattering can be separated into two 
parts—radiation polarized in the plane passing through the x axis and the direction n 
(we choose this plane as the xy plane), and radiation polarized in the perpendicular plane xz. 

The vector of the electric field has the direction of the vector 


n x (d x n) = n(n:d) d 


[see (67.6)]. The component of this vector in the direction perpendicular to the xv plane is— 
d,, and its projection on the xy plane is |sin 0d, — cos Od, |. This latter quantity is most con- 
veniently determined from the z-component of the magnetic field which has the direction 
dxn. 

Squaring E and averaging over all directions of the vector d in the yz plane, we see first 
of all that the product of the projections of the field on the x» plane and perpendicular to it, 
vanishes. This means that the intensity can actually be represented as the sum of two 
independent parts—the intensities of the radiation polarized in the two mutually per- 
pendicular planes. 

The intensity of the radiation with its electric vector perpendicular to the xy plane is 
determined by the mean square of d? —i(d'— d2). For the corresponding part of the 
effective radiation, we obtain the expression 


o +æ 
1, dol 


=—35 d? — d?)dt2ngdo. 68.4 ) 
Xn mesa | ( x) odo (68.4 ) 
0 -œ 
We note that this part of the radiation is isotropic. It is unnecessary to give the expression 
for the effective radiation with electric vector in the xy plane since it is clear that 
dx! +dx} = dx, 


In a similar way we can get the expression for the angular distribution of the effective 
radiation in a given frequency interval do: 
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3cos?0—1] do do 
dx, o = [A@)+ Blo) | Ine} on (68.5) 
where 
20* f c 
Alo) = — [ azane de,  B(o)- T Í (d2 —342,)2ng do. (68.6) 
0 0 


$ 69. Radiation of low frequency in collisions 


Let us consider the low-frequency "tail" of the spectral distribution of the bremsstrahlung: 
the range of frequencies that is low compared to the frequency wo around which the main 
part of the radiation is concentrated: 


w EW. (69.1) 


We shall not assume that the velocities of the colliding particles are small compared to the 
light velocity, as was done in the preceding paragraph; the following formulas are valid 
for arbitrary velocities. In the nonrelativistic case, «9 — 1/t, where t is the order of magni- 
tude of the duration of the collision; in the ultrarelativistic case, « is proportional to the 
square of the energy of the radiating particle (cf. § 77). 

In the integral 


H, = { He dt, 


the field H of the radiation 1s significantly different from zero only during a time interval of 
the order of 1/«g. Therefore, in accord with condition (69.1), we can assume that œ< | in 
the integral, so that we can replace e/*' by unity; then 

H, = | H dt. 


Substituting H = À x n/c and carrying out the time integration, we get: 
1 
H, = | (A2—A,)xn, (69.2) 


where A, — A, is the change in the vector potential produced by the colliding particles during 
the time of the collision. 

The total radiation (with frequency w) during the time of the collision is found by sub- 
stituting (69.2) in (66.9): 


R2 
dé, = hen [(A; —A,) x n]? do do. (69.3) 


We can use the Lienard-Wiechert expression (66.4) for the vector potential, and obtain: 


dén = [ze] “axm “xn [| do ae (69.4) 


4n*c3 Ul=(jon-v, 1 —(Hic)n:v, 


where v, and v, are the velocities of the particle before and after the collision, and the sum is 
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taken over the two colliding particles. We note that the coefficient of do is independent of 
frequency. In other words, at low frequencies [condition (69.1)], the spectral distribution is 
independent of frequency, i.e. d&,,,/dw tends toward a constant limit as œw — 0.7 

If the velocities of the colliding particles are small compared with the velocity of light, then 
(69.4) becomes 


dE nus -uu (Y e(v; —v,) x nj? dodo. (69.5) 
This expression corresponds to the case of dipole radiation. with the vector potential given 
by formula (67.4). 

An interesting application of these formulas is to the radiation produced in the emission 
of a new charged particle (e.g. the emergence of a f-particle from a nucleus). This process is 
to be treated as an instantaneous change in the velocity of the particle from zero to its 
actual value. [Because of the symmetry of formula (69.5) with respect to interchange of v, 
and vp, the radiation originating in this process is identical with the radiation which would 
be produced in the inverse process—the instantaneous stopping of the particle.] The 
essential point is that, since the "time" for the process is t > 0, condition (69.1) is actually 
satisfied for all frequencies. 


PROBLEM 


Find the spectral distribution of the total radiation produced when a charged particle is emitted 
which moves with velocity v. 


Solution: According to formula (69.4) (in which we set v2 = v, v; = 0), we have: 


dé, = do 


R 
202 sin? @ : 
e O asin 0d0. 
4r? v 5 
ebd 2 cos 0 


Evaluation of the integral gives:$ 


2 i 
dê, =L (Em2 —2) do. (1) 
ne\v cv 
For v «£c, this formula goes over into 
2e?v? 
dé. = ine do, 


which can also be obtained directly from (69.5). 


t By integrating over the impact parameters, we can obtain an analogous result for the effective radiation 
in the scattering of a beam of particles. However it must be remembered that this result is not valid for the 
effective radiation when therc is a Coulomb interaction of the colliding particles. because then the integral 
over ọ is divergent (logarithmically) for large o. We shall see in the next section that in this case the effective 
radiation at low frequencies depends logarithmically on frequency and does not remain constant. 

i However, the applicability of these formulas is limited by the quantum condition that fw be small 
compared with the total kinetic energy of the particle. 

§ Even though, as we have already pointed out, condition (69.1) is satisfied for all frequencies, because the 
process is "instantaneous" we cannot find the total radiated energy by integrating (1) over co—the integral 
diverges at high frequencies. We mention that. aside from the violation of the conditions for classical 
behavior at high frequencies. in the present case the cause of the divergence lies in the incorrect formulation 
of the classical problem, in which the particle has an infinite acceleration at the initial time. 
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§ 70. Radiation in the case of Coulomb interaction 


In this section we present, for reference purposes, a series of formulas relating to the 
dipole radiation of a system of two charged particles; it is assumed that the velocities of the 
particles are small compared with the velocity of light. 

Uniform motion of the system as a whole, i.e. motion of its center of mass, is not of 
interest, since it does not lead to radiation, therefore we need only consider the relative 
motion of the particles. We choose the origin of coordinates at the center of mass. Then the 
dipole moment of the system d = e, ri 4- e; r; has the form 


ga ea (he (70.1) 


m,-c-nms 


where the indices | and 2 refer to the two particles, and r=r,—r, is the radius vector 
between them. and 


mim, 


i m;nm; 
is the reduced mass. 

We start with the radiation accompanying the elliptical motion of two particles attracting 
each other according to the Coulomb law. As we know from mechanicsft, this motion can be 
expressed as the motion of a particle with mass ji in the ellipse whose equation in polar 
coordinates 1s 


be? 
ETE TN eU. (70.2) 
r 


where the semimajor axis a and the eccentricity £ are 


a 2 
a= =, e= I- ———-—. (70.3 
2 é| ux 
Here ¢ is the total energy of the particles (omitting their rest energy!) and is negative for a 


finite motion; M = ur! $ is the angular momentum. and a is the constant in the Coulomb 
law: 


a = |e,e9]. 


The time dependence of the coordinates can be expressed in terms of the parametric 


equations 
Cee 
r= a(l—ecos č), t= — (€—esin č). (70.4) 
a 


One full revolution in the ellipse corresponds to a change of the parameter č from 0 to 27; 
the period of the motion is 
ua? 
EX 

We calculate the Fourier components of the dipole moment. Since the motion is periodic 
we are dealing with an expansion in Fourier series. Since the dipole moment is proportional 


to the radius vector r, the problem reduces to the calculatton of the Fourier components of 
the coordinates x = r cos ¢, y =r sin ġ. The time dependence of x and y is given by the 


1 See Mechanics, § 15. 
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parametric equations 


x-a(cot-c) | y-awvl-e?sin£, 


Wot = é—tsin č. (70.5) 


Here we have introduced the frequency 


—— Qe 
Wo = 2njT = x ujpa? = elep . 
xu 
Instead of the Fourier components of the coordinates, it is more convenient to calculate 
the Fourier components of the velocities, using the fact that X, = — ioo nx,; T, = — ig ny,. 
We have 
T 
x= — lM [em dt. 
—iogn | ognT 
0 
But x dt = dx = —asin £dé; transforming from an integral over ¢ to one over č, we have 
2n 
ia inif-isinf) on ro 
XQ, —.— [ gh 7 tI sin $ de. 
2nn 


` 


0 
Similarly, we find 


2n 3 2n 
p 2 oo: 
ia{ 1—€ "DENM .,. davl—-e "DN ` 
ya = eins esin $) cos č dé = eint esing) de 
2nn 2nnc 
0 0 


(in going from the first to the second integral, we write the integrand as cosé= 
(cos č — I/e) + l/c; then the integral with cos £— 1/e can be done, and gives identically 
zero). Finally, we use a formula of the theory of Bessel functions, 


2n x 
1 PEE 1 20 gg 
— [mac =- Í cos (nč — x sin č)dčě = J,(x), (70.6) 
Qn J n 

0 0 


where J,(x) is the Bessel function of integral order #. As a final result, we obtain the follow- 
ing expression for the required Fourier components: 


ia 1-2 
a ia E 
x, =  J,(ne), Ya = — LLL 
n 


J (ng) (70.7) 
(the prime on the Bessel function means differentiation with respect to its argument). 

The expression for the intensity of the monochromatic components of the radiation is 
obtained by substituting x, and y, into the formula 


4o n* (2 e; V 
I, = ° == X, 24+ NE 
3c? i m, s) (| al EM ) 


[see (67.11)). Expressing a and wọ in terms of the characteristics of the particles, we obtain 


finally: 
6An?4^ / e, — e; M I 
[= (2 — <2) |o) 


KE m, mj 
In particular, we shall give the asymptotic formula for the intensity of very high harmonics 
(large i) for motion in an orbit which is close to a parabola (e close to 1). For this purpose, 


—g? 
m fie). (70.8) 
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Va 2/3 
402 (2) d (3) a=] (70.9) 
p, dA 2 


n » 1, l-e <Í, 


we use the formula 


where ® is the Airy function defined on p. 149.1 
Substituting in (70.8) gives: 


j 64-22/3 ntB gt ( e, € {a ap? iG "Uu J " 
SS ee te unas —€~ ia — —t 
$ ám ca Vn, m, E ;) 


34 2/3 n 2,3 
«(5 p? G a=]} (70.10) 
n 2 


This result can also be expressed in terms of the MacDonald function K,: 


64 né^ fle, eV n n 
L2(1—ey Do 0-2 2, LT £222 23d —-(l—2y? 
qeu) 9x? cha? ( =) ies 99 RSE 1 i 


(the necessary formulas are given in the footnote on p. 202). 
Next, we consider the collision of two attracting charged particles. Their relative motion is 
described as the motion of a particle with mass u in the hyperbola 


1-Fecos d = x (70.11) 
where 
zb A + (70.12) 
(now & > 0). The time dependence of r is given by the parametric equations 
r = a(e cosh €—1), en (e sinh £ — č), (70.13) 


where the parameter č runs through values from — œ to +œ. For the coordinates x, y,we 
have 


x = a(e—cosh č), y- ave? — 1 sinh & (70.14) 


+ For n» 1, the main contributions to the integral 
J,(ne) = 1 cos [n(£ —c sin €)] dé 


RA. 
ij 


come from small values of č (for larger values of č, the integrand oscillates rapidly). In accordance with this, 
we expand the argument of the cosine in powers of č: 


TEE 


J,(ne) = 


because of the rapid convergence of the integral, the upper limit has been replaced by oc; the term in ¿° 
must be kept because the first order term contains the small coefficient 1 —e x (1 ~e?)/2. The integral above 
is reduced to the form (70.9) by an obvious substitution. 
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The calculation of the Fourier components (we are now dealing with expansion in a 
Fourier integral) proceeds in complete analogy to the preceding case. We find the result: 


na 2 
Xy = — HY (ive), — yy = — —-——— HY? (ive). (70.15) 
@ 


where H(!) is the Hankel function of the first kind, of order iv, and we have introduced the 
notation 


w Wa 
v= =— (70.16) 


a pvo 
ua? 


(vo is the relative velocity of the particles at infinity; the energy & = yv2/2).t In the calcula- 
tion we have used the formula from the theory of Bessel functions: 


to 
Í erin ixsinhs që = inHP (ix). (70.17) 


— 00 


Substituting (70.15) in the formula 


4w*p? fe, — e; V? 2 2, dw 
dé. 33 (5-2) LOREM OE 


[see (67.10)], we get: 


242,42 2 2 
dé, = 2 (2 - z) {tenor 4E] [HP do (70.18) 
m m, £? 

A quantity of greater interest is the "effective radiation" during the scattering of a parallel 
beam of particles (see § 68). To calculate it, we multiply dê „ by 210 do and integrate over 
all o from zero to infinity. We transform from an integral over ọ to one over e (between the 
limits 1 and oo) using the fact that 2x9 do = 2na?e de; this relation follows from the definition 
(70.12), in which the angular momentum M and the energy & are related to the impact 
parameter ọ and the velocity vy by 


M = novo, =H- 


The resultant integral can be directly integrated with the aid of the formula 
2 
d , 
z [z+ (5 — 1) zi = dz (2Z,Z,), 


where Z,(z) is an arbitrary solution of the Bessel equation of order p.t Keeping in mind that 
for e-» oo, the Hankel function H{') (ive) goes to zero, we get as our result the following 
formula: 


24,3 2 
dx, = TEL (2 - 2) |HP (iv) HQ" (iv) do. (70.19) 
m, m, 


OA 


t Note that the function H {p (ive) is purely imaginary, while its derivative H ip” (ive) is real. 
t This formula is a direct consequence of the Bessel equation 


zee gre (1-5) z=0. 
zZ zZ 
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Let us consider the limiting cases of low and high frequencies. In the integral 
EE 
ei" 7sinb 2 ge = ig HI (iv) (70.20) 
defining the Hankel function, the only important range of the integration parameter £ is 
that in which the exponent is of order unity. For low frequencies (v < 1), only the region of 
large č is important. But for large č we have sinh č > č. Thus, approximately, 
+a 
i MN 
H!P (iv) = — - [ e "inh? gr HU (iv). 
n 


e 
-g 


Similarly, we find that 
HD (iv) = HY? (iv). 
Using the approximate expression (for small x) from the theory of Bessel functions: 


2.2 
iH (ix) = -= In — 
m yx 
(y = ef, where C is the Euler constant; y = 1.781...). we get the following expression for 
the effective radiation at low frequencies: 


16x? fe e; V, (2nvà 3 
dza = 53 (5 -f£ ET e do for o «^. (70.21) 
3c \m, m, 2 a 
It depends logarithmically on the frequency. 
For high frequencies (v > 1), on the other hand, the region of small £ is important in the 


integral (70.20). In accordance with this, we expand the exponent of the integrand in powers 
of ž and get, approximately, 


i ae ie 2i " ive 
Hi (iv) = -- | e e dž = -TRe(fe 6 at). 
Au 0 


By the substitution /v23/6 = 7, the integral goes over into the T-function, and we obtain the 


result : 
i 61? l 
HP (iy) = -— | Tr{-). 
vine A) TG) 


1 f6N 42 
Hiv) = — C) r (5). 
TA/3ÀXY 3 


Next, using the formula of the theory of the T-function, 


Similarly, we find 


roo -x)= 


sin mx’ 


we obtain for the effective radiation at high frequencies: 


16 2 2 3 
Hii e e " A) dw, for o» EZ, (70.22) 
a 


e 33o My, m, 


that is, an expression which is independent of the frequency. 
We now proceed to the radiation accompanying the collision of two particles repelling 
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each other according to the Coulomb law U = a/r(x > 0). The motion occurs in a hyper- 
bola, 
a(e? — 1) 


r 


-14£cosó = (70.23) 


X = a(e+cosh £), y = avc! —1 sinh ë, 


3 
t= = (e sinh ë+ č) (70.24) 


[a and c as in (70.12)]. All the calculations for this case reduce immediately to those given 
above, so it is not necessary to present them. Namely, the integral 
+m 
x,m E | eem? sinh Ede 
w 

—00 
. for the Fourier component of the coordinate x reduces, by making the substitution 
£ — in — £, to the integral for the case of attraction, multiplied by —e7""; the same holds 
for y. 

Thus the expressions for the Fourier components x,,. 1,, in the case of repulsion differ 
from the corresponding expressions for the case of attraction by the factor e^ "". So the only 
change in the formulas for the radiation is an acditional factor e^ ?"*. In particular, for low 
frequencies we get the previous formula (70.21) (since for v <1, e^?" = 1). For high 
frequencies, the effective radiation has the form 


lóna? fe, | eM 2102 poè 
dxo = Pays (= — i) exp (- m. ) da, for «c > PE (70.25) 


It drops exponentially with increasing frequency. 


PROBLEMS 


1. Calculate the average total intensity of the radiation for elliptical motion of two attracting 
charges. 

Solution: From the expression (70.1) for the dipole moment, we have for the total intensity of the 
radiation: 


where we have used the equation of motion uf = — ar/r?. We express the coordinate r in terms of ¢ 
from the orbit equation (70.2) and, by using the equation dt = ur?d$/M, we replace the time in- 
tegration by an integration over the angle ¢ (from 0 to 2z). As a result, we find for the average 


intensity: 
T 
. ] 23 €i €2 2 512 aj & 9/2 2141 M? 
- -—[—-—-— 3— — . 
I T ll Idt 38 (s ma M? "2 
0 


2. Calculate the total radiation A4 for the collision of two charged particles. 

Solution: In the case of attraction the trajectory is the hyperbola (70.11) and in the case of 
repulsion, (70.23). The angle between the asymptotes of the hyperbola and its axis is øo, determined 
from +cos $o = 1/z, and the angle of deflection of the particles (in the system of coordinates in 
which the center of mass is at rest) is x = |z —2¢o|. The calculation proceeds the same as in Problem 
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1 (the integral over ¢ is taken between the limits — o and -: ġo). The result for the case of attraction 


1S 
re Z . "AW zl fea ev 
AC IL tu T5! E )]:3tan?— 6 tan => [— ——], 
a ae a( 5) i 2 (2 =) 


and for the case of repulsion: 


D Z X zi fer cV 
A6 = —— tan? 24(zx- z)[ 1 +3 tan? —6tan — ——J. 
TI ADOS: f e ( 2 2f (m ms 


In both, 7 is understood to be a positive angle, determined from the relation 


Thus for a head-on collision (o — 0, 7 —» x) of charges repelling each other: 


quis 8,8 v5 fic s T 
45a \m m 
3. Calculate the total effective radiation in the scattering of a beam of particles in a repulsive 


Coulomb field. 
Solution: The required quantity is 


arta 2 c 40 
x= Ididnsda m S [5 \ oy [ - pais 
3c3 \m m ri 
o o ò -0 


We replace the time integration by integration over r along the trajectory of the charge. writing 
dt = dr/v,, where the radial velocity v, & r is expressed in terms of r by the formula 


2 1? 2,.2 
t= TGE -vo)] = q- ELTE 
Hu ur? r ur 


The integration over r goes between the limits from oo to the distance of closest approach ro = ro(e) 
(the point at which vr, = 0), and then from ro once again to infinity; this reduces to twice the integral 
from ro to co. The calculation of the double integral is conveniently done by changing the order of 
integration—integrating first over » and then over r. The result of the calculation is: 


4. Calculate the angular distribution of the total radiation emitted when one charge passes by 
another, if the velocitv is so large (though still small compared with the velocity of light) that the 
deviation from straight-line motion can be considered small. 

Solution: The angle of deflection is small if the kinetic energy 4?/2 is large compared to the 
potential energy, which is of order «/o(ut? > a/e). We choose the plane of the motion as the x, y 
plane, with the origin at the center of inertia and the x axis along the direction of the velocity. In 
first approximation, the trajectory is the straight line x = cr, y = o. In the next approximation, the 
equations of motion give 

S 8X, stt is 
HY T ay RE W= ELSE 


B 
B 8 


with 


pm Vx? y? x V goue, 


Using formula (67.7), we have: 


2 2 
dé, = do H : (2 É s) [ep — (n, — jn,] dr, 


nm; mz 


Tah 
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where n is the unit vector in the direction of do. Expressing the integrand in terms of ¢ and per- 
forming the integration, we get: 


a? € ez 
dé, = = — ? (4 — n2 — 3n?) do. 
"o 32eg? (= ms E ») 


§ 71. Quadrupole and magnetic dipole radiation 


We now consider the radiation associated with the succeeding terms in the expansion 
of the vector potential in powers of the ratio a@// of the dimensions of the system to the wave 
length. Since a// is assumed to be small, these terms are generally small compared with the 
first (dipole) term, but they are important in those cases where the dipole moment of the 
system is zero, so that dipole radiation does not occur. 

Expanding the integrand in (66.2), 


1 
A=—- |j sdy, 
cR, [i c 
in powers of r : n/c, we find, correct to terms of first order: 
i 1 @ 
A = — | jedV — nj, dV. 

Khi + OR, sfe n)j, 

Substituting j = ov and changing to point charges, we obtain: 


Ye | ô 


= -- +-—-— Y ev(r-n). (71.1) 
o € 


(From now on, as in $ 67, we drop the index 7’ in all quantities). 
In the second term we write 


^ 


vem) err; 


; vn: 0) 5 Hay) 


-12 r(n: p 5 rx)xn. 


268 Ot 
We then find for A the expression 
A= A. tom a Y er(n-r)+ alo x n), (71.2) 


where d is the dipole moment of the system, and 
1 
m= > Y erxv 


is its magnetic moment. For further transformation, we note that we can, without changing 
the field, add to A any vector proportional to n, since according to formula (66.3), H and E 
are unchanged by this. For this reason we can replace (71.2) by 


à 1 1 
= 3r(n-r)—nr? | - — iii x n. 
A + 6c? Ro ipd n-alt R 
But the expression under the summation sign is just the product n; D,, of the vector n and 


the quadrupole moment tensor Dag = È e(3x, X5 — 0,57?) (see § 41). We introduce the vector 
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D with components D, = D,,n;, and get the final expression for the vector potential: 


d Los 
A = Å= D + — ï . 71.3 
cRo a 6c? Ro cRo MR CES 
Knowing A, we can now determine the fields H and E of the radiation, using the general 
formula (66.3): 


1 : l X 
H = ag dene (Doce (ixn at, 


E= 1 (dxmxns (Dx) n emn. (71.4) 
cR 6c 


0 

The intensity dI of the radiation in the solid angle do is given by the general formula 
(66.6). We calculate here the total radiation, i.e., the energy radiated by the system in unit 
time in all directions. To do this, we average dI over all directions of n; the total radiation 
is equal to this average multiplied by 4z. In averaging the square of the magnetic field, all the 
cross-products of the three terms in H vanish, so that there remain only the mean squares of 
the three. A simple calculation? gives the following result for Z: 

1 . 2 
PNE Di-c = me (71.5) 
Thus the total radiation consists of three independent parts; they are called, respectively, 
dipole, quadrupole, and magnetic dipole radiation. 

We note that the magnetic dipole radiation is actually not present for many systems. 
Thus it is not present for a system in which the charge-to-mass ratio is the same for all the 
moving charges (in this case the dipole radiation also vanishes, as already shown in § 67). 
Namely, for such a system the magnetic moment is proportional to the angular momentum 
(see § 44) and therefore, since the latter is conserved, m = 0. For the same reason, magnetic 
dipole radiation does not occur for a system consisting of just two particles (cf. the problem 
in § 44. In this case we cannot draw any conclusion concerning the dipole radiation). 


2 
12 
303 i 


PROBLEMS 


1. Calculate the total effective radiation in the scattering of a beam of charged particles by 
particles identical with them. 

Solution: In the collision of identical particles, dipole radiation (and also magnetic dipole 
radiation) does not occur, so that we must calculate the quadrupole radiation. The quadrupole 
moment tensor of a system of two identical particles (relative to their center of mass) is 


e 5 
Dag = 2 (3xoxg —r7ó45), 
where x, are the components of the radius vector r between the particles. After threefold differentia- 


+ We present a convenient method for averaging the products of components of a unit vector. Since the 
tensor Ma n5 is symmetric, it can be expressed in terms of the unit tensor «s. Also noting that its trace is 1, 
we have: 


Wing = Sas. 
The average value of the product of four components is: 
Ne Bg Ny Ns = 15(6eg 6,5 ^ 05,055 ~ uo Ôgy). 


The right side is constructed from unit tensors to give a fourth-rank tensor that is symmetric in all its indices; 
the overall coefficient is determined by contracting on two pairs of indices, which must give unity. 
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tion of Dap, we express the first, second, and third derivatives with respect to time of x, in terms 
of the relative velocity of the particles v, as: 


: a at — XU, 
X, e* ——————- 


Xa 77 Uas UN = Aa = gS ri , 


2 r 
where v, =v-r/r is the radial component of the velocity (the second equality is the equation of 
motion of the charge, and the third is obtained by differentiating the second). The calculation leads 
to the following expression for the intensity: 


D 2e$ 1I 
P= — 88 2 (vul 2 
1800?  15m?c5 r* iu 


(vt? =r? +03): v and v, are expressible in terms of r by using the equalities 


m. exa m.. 
E 


We replace the time integration by an integration over r in the same way as was done in Problem 3 
of § 70, namely, we write 


In the double integral (over » and r), we first carry out the integration over o and then over r. The 
result of the calculation is: 
ELLA 


9 mc? 

2. Find the force reacting on a radiating system of particles which is carrying out a stationary 
finite motion. 

Solution: The required force F is obtained by calculating the loss of momentum of the system 
per unit time, i.e. it is the momentum flux carried off by the electromagnetic waves radiated by the 
system: 


F, 1935 dfg — ~ jog,ngR; do; 


the integration is over a large sphere of radius Re. The stress tensor is given by formula (33.3) 
and the fields E and H by (71.4). In view of the transversality of these fields, the integral reduces to 


l 
F=- RELAIS do. 


The average over the direction of n is done using the formulas in the fotnoote on p. 189 (where 
the product of an odd number of components of n gives zero). The result is:t 


l loop 2. 
F, = — — \— d+ z (d xm),}. 
A gacr | ise Perón 3 d $i) 


§ 72. The field of the radiation at near distances 


The formulas for the dipole radiation were derived by us for the field at distances large 
compared with the wavelength (and, all the more, large compared with the dimensions of the 
radiating system). In this section we shall assume, as before, that the wavelength is large 
compared with the dimensions of the system, but shall consider the field at distances which 
are not large compared with, but of the same order as, the wavelength. 


t1 We note that this force is of higher order in 1/c than the Lorentz frictional forces (3 75). The latter give 
no contribution to the total force of recoil: the sum of the forces (75.5) acting on the particles of an electri- 
cally neutral system is zero. 
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The formula (67.4) for the vector potential 


A- Ld 72.1 
~ Ro (72.1), 

is still valid, since in deriving it we used only the fact that Rọ was large compared with the 
dimensions of the system. However, now the field cannot be considered to be a plane wave 
even over small regions. Therefore the formulas (67.5) and (67.6) for the electric and mag- 
netic fields are no longer applicable, so that to calculate them, we must first determine both 
A and 6$. 

The formula for the scalar potential can be derived directly from that for the vector 
potential, using the general condition (62.1), 


1 og 
div A+ - —= 0, 
y c ét 
imposed on the potentials. Substituting (72.1) in this, and integrating over the time, we get 
d 
= —div--. 72.2 
$= -div y (72.2) 


The integration constant (an arbitrary function of the coordinates) is omitted, since we are 
interested only in the variable part of the potential. We recall that in the formula (72.2) as 
well as in (72.1) the value of d must be taken at the time /' = f—(Ro/c).t 

Now it is no longer difficult to calculate the electric and magnetic field. From the usual 
formulas, relating E and H to the potentials, 


] d 
H =-curl _ , (72.3) 
c Ro 
d id 
E = grad div —— ,—. (72.4) 
Ro cR, 


The expression for E can be rewritten in another form, noting that d,/Rọ | just as any 


, . . 1, R . . 
function of coordinates and time of thc form R. Í 229] satisfies the wave equation: 
o c 


(=el) 


curl curl a = grad div a — Aa, 


a? 
{ c 
2 


c? ôt 


Also using the formula 


we find that 


d 
E = curl curl- . (72.5) 
Ro 
The results obtained determine the ficld at distances of the order of the wavelength. It is 


+ Sometimes one introduces the so-called Hertz vector, defined by 


Then 
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understood that in all these formulas it is not permissible to take 1/Ry out from under the 
differentiation sign, since the ratio of terms containing !/Rj to terms with [/Ry is just of the 
same order as 4/Ro. 

Finally, we give the formulas for the Fourier components of the field. To determine H,, 
we substitute in (72.3) for H and d their monochromatic components H, e" and d, e^". 
respectively. However. we must remember that the quantities on the right sides of cquations 
(72.1) to (72.5) refer to the time f’ = t—(Ro/c). Therefore we must substitute in place of d 
the expression 

-iws _ Ro 
de oar ) = dew itt Ro, 


Making the substitution and dividing by e^'**, we get 


eikRo LO 
H, = — ik curl (a. = ikd, x V . 
Ro Ro 
or, performing the differentiation, 
İK l 
H, = ikd, xn e — x) pP (72.6) 
0 0 
where n is a unit vector along Ry. 
In similar fashion, we find from (72.4): 


ikRo ik Ro 


= kd, — Ze tae WWE 


Ro 
and differentiation gives 
k? ik 1\ us k? 3ik 3\ ar 
= — | efo — + 4 e*5o, 72.7 
Ve RTT)" -— R RP xU 02n 


At distances large compared to the wave length (ARy > 1), we can neglect the terms in 
1/R? and I/Rj in formulas (72.7) and (72.6), and we arrive at the field in the “wave zone", 
2 E 
o 5 p P X Cu x es, H, = — p fe xne 
At distances which are small compared to the wave length (ARo < 1), we neglect the terms in 
1/R, and 1/R? and set e*^ = 1; then 


ikRo 


E 


E f3n(d,, n)— da}, 


w = i t 
which corresponds to the static field of an electric dipole (§ 40); in this approximation, the 
magnetic field vanishes. 


PROBLEMS 


Caiculate the quadrupole and magnetic dipole radiation fields at near distances. 
Solution: Assuming, for brevity, that dipole radiation is not present, we have (see the calculation 


carried out in § 71) 
1 dV 
=- j — x — V 
A JE je» j 


where we have expanded in powers of r = Ro—-R. In contrast to what was done in § 71, the factor 
1/ R, cannot here be taken out from under the differentiation sign. We take the differential operator 


Fy 
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out of the integral and rewrite the integral in tensor notation: 


(X; are the components of the radius vector Ro). Transforming from the integral to a sum over the 
charges, we find 


In the same way as in § 71, this expression breaks up into a quadrupole part and a magnetic dipole 
part. The corresponding scalar potentials are calculated from the vector potentials in the same way 
as in the text. As a result, we obtain for the quadrupole radiation: 


and for the magnetic dipole radiation: 


A=curl—, ¢=0 
= curl —, = 
Ro 
[all quantities on the right sides of the equations refer as usual to the time 1’ = t—(Ro/c)]. 
The field intensities for magnetic dipole radiation are: 


1 m m 
E = — - curl —, H = curi curl —. 
€ Ro Ro 
Comparing with (72.3), (72.5), we see that in the magnetic dipole case, E and H are expressed in 
terms of m in the same way as—H and E are expressed in terms of d for the electric dipole case. 
The spectral components of the potentials of the quadrupole radiation are: 
; ^ ikR 2 IER 
AQ» = ik D® Es 9g eL go zm 1 Do 2 i d 
* 6 "9 0X4 Ro’ 6 " 8X, 2X, Ro 
Because of their complexity, we shall not give the expressions for the field. 

2. Find the rate of loss of angular momentum of a system of charges through dipole radiation 
of electromagnetic waves. 

Solution: According to (32.9) the density of flux of angular momentum of the electromagnetic 
field is given by the spatial components of the four-tensor x'7^' -x*7"'. Changing to three-dimen- 
sional notation, we introduce the three-dimensional angular momentum vector with components 
heap M”: the flux density is given by the three-dimensional tensor 


3659, ÁX pT 7,04) = CapX pos 


where 6,5 =T* is the three-dimensional Maxwell stress tensor (and we write all indices as subscripts 
in accordance with the usual three-dimensional notation). The total angular momentum lost by 


the system per unit time is equal to the flux of angular momentum of the radiation field through 
a spherical surface of radius Ro: 


dM 
E a i d Cap 8? 55 df, 


where df = R} do, and n is a unit vector in the direction of Ro. Using the tensor 7,5 from (33.3), 
we find: 

dM R 

pee RE § 2 eF} E . i 

di E f(n -E)n- E) —(n - HXn: H); do. (1) 

In applying this formula to the radiation field at large distances from the system. we must not, 

however, stop at terms —1 Ro: in this approximation n-E =n:H = 0, so that the integrand 

vanishes. These terms [given by (67.5-6)] are sufficient only for calculating the factors n « E and 

n- H; the longitudinal field components n: E and n- H arise from terms —1 R (as a result, the 

integrand in (1) becomes ~! R}, and the distance Ro drops out of the answer, as it should). In 
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the dipole approximation ż =a, and we must distinguish between terms containing additional 
factors [relative to (67.5-6)] — Ro or ~a Ro: it is sufficient to keep only the former. These terms 
can be obtained from (72.3) and (72.5); a calculation to second order in | Ro gives:t 


E:n nd, H:n -0 2 
cR: " e» 
Substituting (2) and /^7.6) in (1), we find: 
dM . . 
TT Tae | (n | d)(n-d) do. 


Finally, writing the integrand in the form e,g,t1gd,215d¢5 and averaging over the direction of n, we 
obtain: 
dM 2 


dM PEN 3 
dt DES d. e 


We note that for a linear oscillator (d — do cos œt with real amplitude do) the expression (3) 
vanishes; there is no loss of angular momentum in the radiation. 


§ 73. Radiation from a rapidly moving charge 


Now we consider a charged particle moving with a velocity which is not small compared 
with the velocity of light. 

The formulas of § 67, derived under the assumption that v < c, are not immediately 
applicable to this case. We can, however, consider the particle in that system of reference in 
which the particle is at rest at a given moment; in this system of reference the formulas 
referred to are of course valid (we call attention to the fact that this can be done only for the 
case of a single moving particle; for a system of several particles there is generally no system 
of reference in which all the particles are at rest simultaneously). 

Thus in this particular system of reference the particle radiates, in time dr, the energy 

2 


2 
dé = x w? dt (73.1) 


[in accordance with formula (67.9], where w is the acceleration of the particle in this system 
of reference. In this system of reference, the total radiated momentum is zero: 
dP — 0. (73.2) 
In fact, the radiated momentum is given by the integral of the momentum flux density in the 
radiation field over a closed surface surrounding the particle. But because of the symmetry 
of the dipole radiation, the momenta carried off in opposite directions are equal in magnitude 
and opposite in direction; therefore the integral is identically zero. 
For the transformation to an arbitrary reference system, we rewrite formulas (73.1) and 
(73.2) in four-dimensional form. It is easy to see that the “radiated four-momentum” dP; 


must be written as 


i 2e? du" du, , , 2e? du du, , 
dP' = — 3e ds ds dx! = — 3c "ds "ds u'ds. (73.3) 


In fact, in the reference frame in which the particle is at rest, the space components of the 


t A nonzero value of n: H would be obtained only if one included terms of higher order in a Ro. 
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four-velocity wu! are equal to zero, and 


therefore the space components of dP' become zero and the time component gives equation 
(73.1). 
The total four-momentum radiated during the time of passage of the particle through a 
given electromagnetic field is equal to the integral of (73.3), that is, 
2e? f du” du, , , 
APi = — — | —- — dxi. 73.4 
3c ds ds 034) 
We rewrite this formula in another form, expressing the four-acceleration du'/ds in terms of 
the electromagnetic field tensor, using the equation of motion (23.4): 


du, e Fou! 
mc— = ` Fu. 
ds c " 
We then obtain 


i 2e* ig pkm i 
AP’ = — 3nle (Fuu (Fou, dx’. (73.5) 


The time component of (73.4) or (73.5) gives the total radiated energy A&. Substituting 
for all the four-dimensional quantities their expressions in terms of three-dimensional 
quantities, we find 


2e? 2 
Ag = Í st (73.6) 


(w = $ is the acceleration of the particle), or, in terms of the external electric and magnetic 
fields: 


AE =a Í M; d. (73.7) 
Žo 1-7; 
The expressions for the total radiated momentum differ by having an extra factor v in the 
integrand. 

It is clear from formula (73.7) that for velocities close to the velocity of light, the total 
energy radiated per unit time varies with the velocity essentially like [1 — (v?/c?)]  ! , that is, 
proportionally to the square of the energy of the moving particle. The only exception is 
motion in an electric field, along the direction of the field. In this case the factor [1 —(v?/c?)] 
standing in the denominator is cancelled by an identical factor in the numerator, and the 
radiation does not depend on the energy of the particle. 

Finally there is the question of the angular distribution of the radiation from a rapidly 
moving charge. To solve this problem, it is convenient to use the Lienard-Wiechert expres- 
sions for the fields, (63.8) and (63.9). At large distances we must retain only the term of 
lowest order in 1/R [the second term in (63.8)]. Introducing the unit vector n in the direction 
of the radiation (R = nR), we get the formulas 
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H-nxE, (73.8) 


where all the quantities on the right sides of the equations refer to the retarded time 
t' 2 t—(R/c). 
The intensity radiated into the solid angle do is dI = (c/4n)E? R? do. Expanding E?, we get 


v? 
2 | : . ? (-5)ew 
e j2mw(vw), w gu OE NN UR (73.9) 


"IU PST eu 


If we want to determine the angular distribution of the total radiation throughout the 
whole motion of the particle, we must integrate the intensity over the time. In doing this, it is 
important to remember that the integrand is a function of t’; therefore we must write 


dl = 


et . 

die ar (1-27) dt’ (73.10) 
c 

[see (63.6)], after which the integration over ¢’ is immediately done. Thus we have the 

following expression for the total radiation into the solid angle do: 


( E Uy ( uu dr. (73.11) 
c| 1- — po TR 
[4 c B 


As we see from (73.9), in the general case the angular distribution of the radiation is 
quite complicated. In the ultrarelativistic case, (1—(v/c) « 1) it has a characteristic 
appearance, which is related to the presence of high powers of the difference 1 —(v- n/c) in 
the denominators of the various terms in this expression. Thus, the intensity is large within 
the narrow range of angles in which the difference | — (v : n/c) is small. Denoting by 0 the 
small angle between n and v, we have: 


v? : 
£g rond. aet. iem. 


v v 0? |] v? 5 
PSU a tet aS oy hea 0); 
or c 2 a c ) 


this difference is small for 


2 
9s fte 5 (73.12) 
Cc 


Thus an ultrarelativistic particle radiates mainly along the direction of its own motion, 
within the small range (73.12) of angles around the direction of its velocity. 

We also point out that, for arbitrary velocity and acceleration of the particle, there are 
always two directions for which the radiated intensity is zero. These are the directions tor 
which the vector n—(v/c) is parallel to the vector w, so that the field (73.8) becomes zero. 
(See also problem 2 of this section.) 

Finally, we give the simpler formulas to which (73.9) reduces in two special cases. 
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If the velocity and acceleration of the particle are parallel, 


e wxn 
H a yT 
eR (i -**) 
c 
and the intensity is 
2 2 ata? 0 
dI = M — do. (73.13) 


3 6 
anc (i — cos 0) 
c 
It is naturally, symmetric around the common direction of v and w, and vanishes along 
(0 = 0) and opposite to (0 = z) the direction of the velocity. In the ultrarelativistic case, 
the intensity as a function of @ has a sharp double maximum in the region (73.12), with a 
steep drop to zero for 0 = Q. 
If the velocity and acceleration are perpendicular to one another, we have from (73.9): 


v? 
2.3 i (1-3) sin? 0 cos? ¢ 


ew 


Anc? 4^7. 6 
me (1-2 cos e) (1-2 cos o) 
c c 


where @ is again the angle between n and v, and ó is the azimuthal angle of the vector n 
relative to the plane passing through v and w. This intensity is symmetric only with respect 
to the plane of v and w, and vanishes along the two directions in this plane which form the 
angle 0 = cos”! (v/c) with the velocity. 


dl = do, (73.14) 


PROBLEMS 


1. Find the total radiation from a relativistic particle with charge e,, which passes with impact 
parameter o through the Coulomb field of a fixed center (with potential ¢ = es/r). 

Solution: In passing through the field, the relativistic particle is hardly deflected at all.t We may 
therefore regard the velocity v in (73.7) as constant, so that the field at the position of the particle is 


ear ezr 
jj GE aa anaa 
r? (0? per?) 


L = 
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+ For vc, deviations through sizable angles can occur only for impact parameters 0 ~ e?/mc?, which 
cannot in general be treated classically. 


§ 74 SYNCHROTRON RADIATION (MAGNETIC BREMSSTRAHLUNG) 199 


with x = vt, y = 0. Performing the time integration in (73.7), we obtain: 
neie  4c?—i? 
Imey c2—v?’ 
2. Find the directions along which the intensity of the radiation from a moving particle vanishes. 


Solution: From the geometrical construction (Fig. 15) we find that the required directions n lie 
in the plane passing through v and w, and form an angle 7 with the direction of w where 


Ac = 


v. 
sin y =-~ sin a, 
c 
and « is the angle between v and w. 


3. Find the intensity of the radiation from a particle which is carrying out a stationary motion 
in the field of a circularly polarized piane electromagnetic wave. 

Solution: According to the results of problem 3 of : 48, the particle moves in a circle. and its 
velocity at each moment is parallel to H and perpendicular to E. Its kinetic energy is 


mc? ea, 
== ¢\ pnt? = cy 


Y ] 026? 
(where we use the notation of the problem cited). From formula (73.7) we tind the intensity of the 
radiation: 
EN NE NE 
3n? ])-)o0 — 3o mee ; 


4. The same problem in the field of a linearly polarized wave. 

Solution: According to the results of problem 2 of : 48, the motion occurs in the plane xy, 
passing through the direction of propagation of the wave (the x axis) and the direction of E (the 
y axis); the field H is along the = direction (and H, = E,). From (73.7) we find: 

O WWE n er 
Ommel re’ 


The average over the period of the motion, which was done using a parametric representation in 


the problem cited, gives the result 
Ta eE [ 23 ( E 
BKITIE 8 new ` 


§ 74. Synchrotron radiation (magnetic bremsstrahlung) 


We consider the radiation from a charge moving with arbitrary velocity in a circle in a 
uniform constant magnetic field; such radiation is called magnetic bremsstrahlung. The 
radius of the orbit r and the cyclic frequency of the motion w,, are expressible in terms of the 
field intensity H and the velocity of the particle v, by the formulas (see § 21): 


C ce c A (74.1) 


The total intensity of the radiation over all directions is given directly by (73.7), omitting 
the time integration, in which we must set E = 0 and H Lv: 


jante ud (74.2) 
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We see that the total intensity is proportional to the square of the momentum of the particle. 
If we are interested in the angular distribution of the radiation, then we must use formula 
(73.11). One quantity of interest is the average intensity during a period of the motion. For 
this we integrate (73.11) over the time of revolution of the particle in the circle and divide 
the result by the period T = 2z/w,,. 
We choose the plane of the orbit as the X Y plane (the origin is at the center of the circle), 
and we draw the YZ plane to pass through the direction n of the radiation (Fig. 16). The 


É 
| 
| 
| 


f< 


Fic. 16. 


magnetic field is along the negative Z axis (the direction of motion of the particle in Fig. 16 
corresponds to a positive charge e). Further, let 0 be the angle between the direction k of the 
radiation and the Y axis, and $ = w,,t be the angle between the radius vector of the particle 
and the X axis. Then the cosine of the angle between k and the velocity v is cos 0 cos ó 
(the vector v lies in the XY plane, and at each moment is perpendicular to the radius vector 
of the particle). We express the acceleration w of the particle in terms of the field H and the 
velocity v by means of the equation of motion [see (21.1)]: 


e v? 


wcz—,.f[l-4vxH. 
c? 


After a simple calculation, we get: 


2 2 
u EP y 1025 sin? 0+ (2 - cos cos 6) 
d= doggie (1) [| re 
n 
same 0 (1- ? cos cos e) 


(the time integration has been converted into integration over $ = wyt). The integration is 
elementary, though rather lengthy. As a result one finds the following formula: 


v? v? v? v? 
e*H?v? (1 - 5) 2+ — cos? 0 (i - 30 —4 cos? 0) cos? 6 
c c^ c c 
dI = do - A — ——— 
8zm?c^ E 572 2 EVE 
mre (i-5 cos? 0) «(1-5 cos? 0) 
c c 
(74.4) 
The ratio of the intensity of radiation for 0 = 2/2 (perpendicular to the plane of the orbit) 


ó (74.3) 
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to the intensity for 0 = 0 (in the plane of the orbit) is 


2 
[^ 
ato = - YA. (74.5) 


dl p?\5/? 
aan 8Íí1— — 
(o), ( =) 


As v0, this ratio approaches 4, but for velocities close to the velocity of light, it becomes 
very large. 

Next we consider the spectral distribution of the radiation. Since the motion of the charge 
is periodic, we are dealing with expansion in a Fourier series. The calculation starts con- 
veniently with the vector potential. For the Fourier components of the vector potential we 
have the formula [see (66.12)}: 

A _ elkRo Konnt-k: 0g 
(Tz RT $ e r, 
where the integration is taken along the trajectory of the particle (the circle). For the co- 
ordinates of the particle we have x = r cos wyt, y =r sin wyt. As integration variable we 
choose the angle $ = wyt. Noting that 


k-r = kr cos 0 sin @ = (nv[c) cos 0 sin $ 
(k = nw,,/c = nvjcr), we find for the Fourier components of the x-component of the vector 
potential: 


ev ; . v . 
ha =~ ge eto f etn tonm) in bd 
m= eg oe | nl $ dà 
We have already had to deal with such an integral in 8 70. It can be expressed in terms of the 
derivative of a Bessel function: 


iev ar, f RU 
xn — cR, elkRo J’ (= cos 0), (74.6) 
Similarly, one calculates A,,: 
e . nv 
Ayn = ikRo J (= 0 74.7 
Mi Ra cos 0* "e t) (74.7) 


The component along the Z axis obviously vanishes. 
From the formulas of § 66 we have for thc intensity of radiation with frequency w = nwy, 
in the element of solid angle do: 


€ 3 [4 2 2 
dl, = on |H,|? Ri do = jd x A,|? Ry do. 


Noting that 
A x k|? = AK? + AZK sin? 0, 


and substituting (74.6) and (74.7), we get for the intensity of radiation thc following formula 
(G. A. Schott, 1912): 


245 H7 T > >f nt 2 [mno 
dl, = < ( - SIL 0-52 ( — cos 0) + Je (* cos 0)| do. (74.8) 
C C t 
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To determine the total intensity over all directions of the radiation with frequency w = no, 
this expression must be integrated over all angles. However, the integration cannot be carried 
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out in finite form. By a series of transformations, making use of certain relations from the 
theory of Bessel functions, the required integral can be written in the following form: 
2 


v 
2e*H? | 1-5 e 
cj [n , /2nv 5 v? onéydé 749 
1, = — ay ep an u -n I~ . Jn) e|. ( 9) 

We consider in more detail the ultrarelativistic case where the velocity of motion of the 
particle is close to the velocity of light. 

Setting v = c in the numerator of (74.2), we find that in the ultrarelativistic case the total 
intensity of the synchrotron radiation is proportional to the square of the particle energy &: 


2 4H? ó 2 
I= sa e (74.10) 


mc? 


The angular distribution of the radiation is highly anisotropic. The radiation is con- 
centrated mainly in the plane of the orbit. The "width" A0 of the angular range within 
which most of the radiation is included is easily evaluated from the condition 1 —(v?/c?) 
cos? 0 ~ 1 —(v?[c?), writing 0 = n/2- A0, sin 0 = 1 —(A0)?/2. It is clear thatt 


0S 2 
w~ fih e. (74.11) 
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We shall see below that in the ultrarelativistic case the main role in the radiation is played 
by frequencies with large n (Arzimovich and Pomeranchuk, 1945). We can therefore use 
the asymptotic formula (70.9), according to which: 
l 
JannE) = ——-- e[n'(1— £?)]. (74.12) 
Vrn? 
Substituting in (74.9), we get the following formula for the spectral distribution of the 
radiation for large values of n:1 


2 
2e*H? T2 ut » 
I=- aime {ow+ 5 Í upd}, (74.13) 
Tv 
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For 40 the function in the curly brackets approaches the constant limit 
'(0) = —0.4587...§ Therefore for u < 1, we have 


+ This result is, of course, in agreement with the angular distribution of the instantaneous intensity which 
we found in the preceding section [sce (73.12)]; however, the reader should not confuse the angle 8 of this 
section with the angle 0 between n and v in § 73! 

t In making the substitution, the limit n?? of the integral can be changed to infinity, to within the required 
accuracy; we have also set v = c wherever possible. Even though values of & close to 1 are important in the 
integral (74.9), the use of formula (74.12) is still permissible, since the integral converges rapidly at the lower 
limit. 

$ From the definition of the Airy function, we have: 


^ 31/87 (8) 
+13 gj Lm ~ 
J x sin x dx = Vn 
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4H? 242 E 3 
Lo (=) ni, pene (5) (74.14) 


2373 
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For u » ], we can use the asymptotic expression for the Airy function (see the footnote 
on p. 149, and obtain): 


H n! 23 
jx. cat Lm ey 74.15 
g 2/nm?c? P 3"Ve : PT mc? Uem 


that is, the intensity drops exponentially for large n. 
Consequently the spectrum has a maximum for 


e 3 
m): 


and the main part of the radiation is concentrated in the region of frequencies for which 


3 2 
Q ^ Og (<3) -= (4) : (74.16) 


m mc \mc 


These values of œw are very large, compared to the distance wy between neighboring fre- 
quencies. We may say that the spectrum has a ‘‘quasicontinuous” character, consisting of a 
large number of closely spaced lines. 
In place of the distribution function 7, we can therefore introduce a distribution over the 
continuous series of frequencies w = nog, writing 
do 
di = I,dn = I, —. 
Ox 
For numerical computations it is convenient to express this distribution in terms of the 
MacDonald function K,.t After some simple transformations of formula (74.13), it can be 
written as 


3 eH f 
dI = do y3eH F (2) FQ) =ë Í K,(dé, (74.17) 
2n mc We 
g 
where we use the notation 
2 
ie (5) ; (74.18) 
2mc \mc 


Figure 17 shows a graph of the function F(é). 


+ The connection between the Airy function and the function K;;; is given by formula (4) of the footnote 
on p. 149, In the further transformations one uses the recursion relations 


KiaQ)—Kii@)=—2 Ky 2KVG) = - KG) Krai, 
where K_,(x) = K,(x). In particular, it is easy to show that 


O(t)=— v Kan 6 en) 
az 
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Finally, a few comments on the case when the particle moves, not in a plane orbit, but in 
a helical trajectory, i.e. has a longitudinal velocity (along the field) v, = v cos y (where x 
is the angle between H and v). The frequency of the rotational motion is given by the same 
formula (74:1); but now the vector v moves, not in a circle, but on the surface of a cone with 
its axis along H and with vertex angle 27. The total intensity of the radiation (defined as the 
total energy loss per sec from the particle) will differ from (74.2) in having H replaced by 
H, = Hsing. 

In the ultrarelativistic case the radiation is concentrated in directions near the generators 
of the “velocity cone”. The spectral distribution and the total intensity (in the same sense as 
above) are obtained from (74.17) and (74.10) by the substitution H — H,. If we are talking 
about the intensity as seen in these directions by an observer at rest, then we must introduce 
an additional factor which takes into account the approach or moving away of the radiator 
(a particle moving in a circle). This factor is given by the ratio dt/dt,,,, where dto, is the 
time interval between arrival at the observer of signals emitted by the source at an interval 


dt. It is obvious that i 
diy. = a(1-; t| COS 9) 
c 


where 9 is the angle between the directions of k and H (the latter being taken as the positive 


direction of the velocity v,,). In the ultrarelativistic case, when the direction of k is close 
to the direction of v, we have $= y, so that 


a He og 
as Der cos x) x (74.19) 


PROBLEMS 


]. Find the law of variation of energy with time for a charge moving in a circular orbit in a 
constant uniform magnetic field, and losing energy by radiation. 
Solution: According to (74.2), we have for the energy loss per unit time: 


dê  2e*H? 
dt 3mic? 
& is the energy of the particle). From this we find: 


e t DELE - const 
-—,; = coth | ——-. 1-- cons 
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As t increases, the energy decreases monotonically, approaching the value 4 — mc? (for complete 
stopping of the particle) asymptotically as 1--> oc. 
2. Find the asymptotic formula for the spectral distribution of the radiation at large values of n 
for a particle moving in a circle with a velocity which is not close to the velocity of light. 
Solution: We use the well known asymptotic formula of the theory of Bessel functions 


J,(ue) Ie: En Ue E Ne evi-@ i 
vV2nn(1—c2))* EL: ; d 
which is valid for a(l —22)?:?: - ]. Using this formula, we find from (74.9): 
v 7 
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This formula is applicable for u[I— (c72/c2)]*? 7-1; if in addition 1 - (¢?/c*) is small, the formula 
goes over into (74.15). 

3. Find the polarization of the synchrotron radiation. 

Solution: The electric field E, is calculated from the vector potential A, (74.6-7) according to the 
formula 


E, = (AQ >k = ROA) CRAS. 


Let e;, e2 be unit vectors in the plane perpendicular to k, where e, is parallel to the x axis and e; 
lies in the »z plane (their components are e; = (1, 0, 0), c, — (0, sin 0, --cos 0); the vectors e;, ea 
and k form a right-hand system. Then the electric field will be: 

E, = ik A,, €i: ik Sin 0 Ay, es, 
or, dropping the unimportant common factors: 


to [Ht nt Y 
E, ~ - J, (= cos o) e; i tan 0J, (= cos ) ies. 
c c 


The wave is elliptically polarized (see 3 48). 
In the ultrarelativistic case, for large u and small angles 0, the functions J, and J; are expressed 
in terms of K, , and Kz., where we set 


QU o EA mE? 3 
l --— cos? U x I— e-( ) ; OF 
c 


in the arguments. As a result we get: 


^ "Ho. ] n. meN? | . 
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For 0 —0 the elliptical polarization degenerates into linear polarization along e;. For large 
6(|0| >> mc?/6, n0? >> 1), we have K,)4(x) Z Kaala) & V2/2xe77, and the polarization tends to 
become circular: E, ~ e; + ie;; the intensity of the radiation, however, also becomes exponentially 
small. In the intermediate range of angles the minor axis of the ellipse lies along eo and the major 
axis along e;. The direction of rotation depends on the sign of the angle 0 (0 >- O if the directions 
of H and k lie on opposite sides of the orbit planc, as shown in Fig. 16). 


§ 75. Radiation damping 


In § 65 we showed that the expansion of the potentials of the field of a system of charges 
in a series of powers of r/c leads in the second approximation to a Lagrangian completely 
describing (in this approximation) the motion of the charges. We now continue the expansion 
of the field to terms of higher order and discuss the effects to which these terms lead. 
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In the expansion of the scalar potential 


$- [aote 


the term of third order in 1/c is 


^3 
$0 = -L <. Í R?odV 
^. 6e at? =? 


§ 75 


(75.1) 


For the same reason as in the derivation following (65.3), in the expansion of the vector 


potential we need only take the term of second order in 1/c. that is, 


1280... 
Ama cA fjar. 


We make a transformation of the potentials: 


$ =¢ġ-- 5, A’ = A + grad f, 


choosing the function f so that the scalar potential ¢’° becomes zero: 


1 @ 
f= - aga | Rèeav. 
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Then the new vector potential is equal to 


1 ^ 1 ^42 
A?) = -35 iav- mU [ Rov 
180 f, | e 
=-a5 |i BEP RodV. 


(75.2) 


Making the transition from the integral to a sum over individual charges. we get for the 


first term on the right the expression 


lc. 
-aL e. 


In the second term, we write R = Ry —r, where Ry and r have their usual meaning (see § 66); 


then R = — t = —v and the second term takes the form 
i . 
38 Y ev. 
Thus, 


2 
AO = 3c Y ev. 


(75.3) 


The magnetic field corresponding to this potential is zero (H = curl A?’ = 0), since 


A’?) does not contain the coordinates explicitly. The electric field E = —(1/c)A” is 
2 
E--— 
303 d, (75.4) 


where d is the dipole moment of the system. 


§ 75 RADIATION DAMPING 207 


Thus the third order terms in the expansion of the field lead to certain additional forces 
acting on the charges, not contained in the Lagrangian (65.7); these forces depend on the 
time derivatives of the accelerations of the charges. 

Let us consider a system of charges carrying out a stationary motiont and calculate the 
average work done by the field (75.4) per unit time. The force acting on each charge e is 
f = eE, that is, 

2e . 
BE 
The work done by this force in unit time is f - v, so that the total work performed on all the 
charges is equal to the sum, taken over all the charges: 


yey wae 0553. ay 
Vive sd Lee Gdd= oa d- Say. 


When we average over the time, the first term vanishes, so that the average work is equal to 


f (75.5) 


NN 2 
MN Fi 
Xfv--ld" (75.6) 
The expression standing on the right is (except for a sign reversal) just the average energy 
radiated by the system in unit time [see (67.8)]. Thus, the forces (75.5) appearing in third 
approximation, describe the reaction of the radiation on the charges. These forces are called 
radiation damping or Lorentz frictional forces. 
Simultaneously with the energy loss from a radiating system of charges, there also occurs 
a certain loss of angular momentum. The decrease in angular momentum per unit time, 
aM/dt, is easily calculated with the aid of the expression for the damping forces. Taking the 
time derivative of the angular momentum M=Zrxp, we have M-Xrxj, since 
Li xp = m(vxy) =0. We replace the time derivative of the momentum of the particle 
by the friction force (75.5) acting on it, and find 


: 2 2 
M=} rxf=z; 2 erxd= z3 dxd. 


We are interested in the time average of the loss of angular momentum for a stationary 
motion, just as before, we considered the time average of the energy loss. Writing 


d ; ; 
dxd- 7 (dxd)-dxd 


and noting that the time derivative (first term) vanishes on averaging, we finally obtain the 
following expression for the average loss of angular momentum of a radiating system: ; 
< dxd. (75.1) 
[4 
Radiation damping occurs also for a single charge moving in an external field. It is 
equal to 
x s 
f=—,¥. (77,8) 
+ More precisely, a motion which, although it would have been stationary if radiation were neglected, 


proceeds with continual slowing down. 
1 In agreement with the result (3) obtained in problem 2 of s 72. 
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For a single charge, we can always choose such a system of reference that the charge at the 
given moment is at rest in it. If, in this reference frame, we calculate the higher terms in the 
expansion of the fizld produced by the charge, it turns out that they have the following 
property. As the radius vector R from the charge to the field point approaches zero, all these 
terms become zero. Thus in the case of a single charge, formula (75.8) is an exact formula for 
the reaction of the radiation, in the system of reference in which the charge is at rest. 

Nevertheless, we must keep in mind that the description of the action of the charge “‘on 
itself" with the aid of the damping force is unsatisfactory in general, and contains contradic- 
tions. The equation of motion of a charge, in the absence of an external field, on which only 
the force (75.8) acts, has the form 


2e? 


my =Ñ. 
3c? 


This equation has, in addition to the trivial solution v = const, another solution in which the 
acceleration v is proportional to exp(3mc?t/2e?), that is, increases indefinitely with the time. 
This means, for example, that a charge passing through any field, upon emergence from the 
field, would have to be infinitely "self-accelerated". The absurdity of this result is evidence for 
the limited applicability of formula (75.8). 

One can raise the question of how electrodynamics, which satisfies the law of conservation 
of energy, can lead to the absurd result that a free charge increases its energy without limit. 
Actually the root of this difficulty lies in the earlier remarks (8 37) concerning the infinite 
electromagnetic "intrinsic mass" of elementary particles. When in the equation of motion 
we write a finite mass for the charge, then in doing this we essentially assign to it formally an 
infinite negative "intrinsic mass" of nonelectromagnetic origin, which together with the 
electromagnetic mass should result in a finite mass for the particle. Since, however, the sub- 
traction of one infinity from another is not an entirely correct mathematical operation,this 
leads to a series of further difficulties, among which 15 the one mentioned here. 

In a system of coordinates in which the velocity of the particle is small, the equation of 
motion when we include the radiation damping has the form 

2e 


mi = E+ vxH 25 . (75.9) 
c 


From our discussion, this equation is applicable only to the extent that the damping force 
is small compared with the force exerted on the charge by the external field. 

To clarify the physical meaning of this condition, we proceed as follows. In the system of 
reference in which the charge is at rest at a given moment, the second time derivative of the 
velocity is equal, neglecting the damping force, to 


e. e 
V2-—E-T---Y*xH. 
m mc 


In the second term we substitute (to the same order of accuracy) + = (e/m)E, and obtain 


Corresponding to this, the damping force consists of two terms: 


2e . 2e* 
= —,E ExH. 75.10 
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If w is the frequency of the motion, then E is proportional to wE and, consequently, the 
first term is of order (e^w/mic?)E; the second is of order (e*/m?c*) EH. Therefore the condition 
for the damping force to be small compared with the force e£ exerted by the external field on 
the charge gives, first of all, 


or, introducing the wavelength 7 ~ c/w, 
e 
A» 


> (75.11) 
mc 

Thus formula (75.8) for the radiation damping is applicable only if the wavelength of the 
radiation incident on the charge is large compared with the "radius" of the charge e?/mc?. 
We see that once more a distance of order e?/nic? appears as the limit at which electro- 
dynamics leads to internal contradictions (see $ 37). 

Secondly, comparing the second term in the damping force to the force e£, we find the 
condition 

m?c* 


H & 


BC (75.12) 
Thus it is also necessary that the field itself be not too large. A field of order ni?^c*/e? also 
represents a limit at which classical electrodynamics leads to internal contradictions. Also 
we must remember here that actually, because of quantum effects, electrodynamics is 
already not applicable for considerably smaller fields.t 

To avoid misunderstanding, we remind the reader that the wavelength in (75.11) and the 
field value in (75.12) refer to the system of reference in which the particle is at rest at the 
given moment. 


PROBLEM 


Calculate the time in which two attracting charges, performing an elliptic motion (with 
velocity small compared with the velocity of light) and losing energy due to radiation, "fall in" 
toward each other. 

Solution: Assuming that the relative energy loss in one revolution is small, we can equate the 
time derivative of the energy to the average intensity of the radiation (which was determined in 
problem 1 of $ 70): 


a|e| Clee (e -&y (s- 365 " 


dt 36M? m, ms uc 


where a = |jeye4]. Together with the energy, the particles lose angular momentum. The loss of 
angular momentum per unit time is given by formula (75.7); substituting the expression (70.1) for 


d, and noting that uf = — ar/r? and M = ur x v, we find: 
dM — 2«(e e; VM 
d | 3gXWm mj r¥ 


We average this expression over a period of the motion. Because of the slowness of the changes in 
M, it is sufficient to average on the right only over r ^?; this average value is computed in precisely 
the same way as the average of r^ ^ was found in problem 1 of 3 70. As a result we find for the 


+ For fields of order m?c?/fe, i.e., when hw, ~ mc?. This limit is hc/e? = 137 times smaller than the limit set by 
the condition (75.12). The ratio of these distances to Ry is of order hc/e? ~ 137. 
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average loss of angular momentum per unit time the following expression: 
dM  2aQu|é|]y" [ei — e2 \? 
dt 39M? my m 


{as in equation (1), we omit the average sign]. Dividing (1) by (2), we get the differential equation 


dié| we (3-2 [ei Mf A, 


(2) 


dM — 2M? ux 
which, on integration, gives: 


2 43 Dor 
= esI) tu ya: (3) 
2M? Mi Mo 
The constant of integration is chosen so that for Af = Mo. we have & - £o. where Mo and & are 
the initial angular momentum and energy of the particles. 

The "falling in’ of the particles toward one another corresponds to M -»0. From (3) we sce 
that then 4 — — oc. 

We note that the product |4| M? tends toward 4/2?/2. and from formula (70.3) it is clear that the 
eccentricity e— 0, i.e. as the particles approach one another, the orbit approaches a circle. Sub- 
stituting (3) in (2), we determine the derivative dr/dM expressed as a function of M. after which 
integration with respect to M between the limits Mo and 0 gives the time of fall: 


OMS e > T Ro ; < " 
fran = a MEI (Nux. 2M ida) 7. 
av 2éqle? Vm me 


§ 76. Radiation damping in the relativistic case 


We derive the relativistic expression for the radiation damping (for a single charge), which 
is applicable also to motion with velocity comparable to that of light. This force is now a 
four-vector g', which must be included in the equation of motion of the charge, written in 
four-dimensional form: 
du e 


me- = zF utg" (76.1) 


To determine g° we note that for r < c. its three space components must go over into the 
components of the vector f/c (75.8). It is easy to see that the vector (2e7/3c)/(d?ui/ds?) 
has this property. However, it does not satisfy the identity giu; = 0. which is valid for any 
force four-vector. In order to satisfy this condition, we must add to the expression given, a 
certain auxiliary four-vector, made up from the four-velocity uê and its derivatives. The 
three space components of this vector must become zero in the limiting case v = 0, in order 
not to change the correct values of f which are already given by (2¢7/3c)/(d?u'/ds?). The 
four-vector u' has this property, and therefore the required auxiliary term has the form ai. 


The scalar « must be chosen so that we satisfy the auxiliary relation g'u; = 0. As a result we 
find 


"EU. (= T. Se) 
f = = uut —-}, (76.2) 
ds” 


In accordance with the equations of motion, this expression can be written in another form, 
by expressing d ?v'/ds? directly in terms of the field tensor of the external field acting on the 
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particle: 
du’ e pu 
— = Uy, 
ds me 
d^ e eF* e 


I ik I 
— u,u + ——F'UFu. 
ds? mc? ax! ^' m?c* M 
In making substitutions, we must keep in mind that the product of the tensor ĉF*/êx', 
which is antisymmetric in the indices i, k, and the symmetric tensor u; u, gives identically 
zero. So, 
2e? OF" 2e? 2e* 
— / il k 1 km i 
-—— — uu — — F" Fu" + —-GuuyF")u:. 76.3 
3mc? x! ^ 3m?c* H Ime | ue )( m) (76.3) 
The integral of the four-force g' over the world line of the motion of a charge, passing 
through a given field, must coincide (except for opposite sign) with the total four-momentum 
AP' of the radiation from the charge [just as the average value of the work of the force f in 
the nonrelativistic case coincides with the intensity of dipole radiation; see (75.6)]. It is 
easy to check that this is actually so. The first term in (76.2) goes to zero on performing the 
integration, since at infinity the particle has no acceleration. i.e. (du'/ds) = 0. We integrate 
the second term by parts and get: 


2e [ , dy 2e? fdu, du^ 
— t = —— i— d = m — —— d i 
[s'as 3c E g” 3c IE: ds ^ 


which coincides exactly with (73.4). 

When the velocity of the particle approaches the velocity of light, those terms in the space 
components of the four-vector (76.3) increase most rapidly which contain the components 
of the four-velocity in the third degree. Therefore, keeping only these terms in (76.3) and 
using the relation (9.18) between the space components of the four-vector g' and the 
three-dimensional force f, we find for the latter: 

2e* I km 
f = imic (Fuu XF RATS 
where n is a unit vector in the direction of v. Consequently. in this case the force f is opposite 
to the velocity of the particle; choosing the latter as the X axis, and writing out the four- 


dimensional expressions, we obtain: 


2e* (E,- HY M (E, HY 
0 3m?c* i v? 


c? 


f.- (76.4) 


(where we have set r = c everywhere except in the denominator). 

We see that for an ultrarelativistic particle, thc radiation damping is proportional to the 
square of its energy. 

Let us call attention to the following intcresting situation. Earlier we pointed out that the 
expression obtained for the radiation damping is applicable only to fields which (in the 
reference system Ko. in which the particle is at rest) are small compared with nice. Let 
F be the order of magnitude of the external field in the reference system A, in which the 
particle moves with velocity r. Then in the Ko frame, the field has the order of magnitude 
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FIV - eye (see the transformation formulas in § 24). Therefore F must satisfy the 
condition 3 
F 
et oal. (76.5) 


m?c* ]1— 


At the same time, the ratio of the damping force (76.4) to the external force (~ eF) is of 


the order of 3 
eF 


and we see that, even though the condition (76.5) is satisfied, it may happen (for sufficiently 
high energy of the particle) that the damping force is large compared with the ordinary 
Lorentz force acting on the particle in the electromagnetic field.t Thus for an ultrarelativistic 
particle we can have the case where the radiation damping is the main force acting on the 
particle. 

In this case the loss of (kinetic) energy of the particle per unit length of path can be 
equated to the damping force f, alone; keeping in mind that the latter is proportional to the 
square of the energy of the particle, we write 


do xin 
= k(x) Ein 
"dx ( ) kin 
where we denote by k(x) the coefficient, depending on the x coordinate and expressed in 
terms of the transverse components of the field in accordance with (76.4). Integrating this 
differential equation, we find 


x 


| I 
=— k(x) dx, 
Ekin So + Í 0) 5 


-20 


where é'y represents the initial energy of the particle (its energy for x ^ — œ). In particular, 


the final energy &, of the particle (after passage of the particle through the field) is given by 
the formula 


tx 
[D 0d kG)d 
e$) 5 | x) dx 


We see that for £o — oo, the final 6, approaches a constant limit independent of & 
(I. Pomeranchuk, 1939). In other words, after passing through the field, the energy of the 
particle cannot exceed the energy c'eri defined by the equation 


to 
1 


t We should emphasize that this result does not in any way contradict the derivation given earlier of the 
relativistic expression for the four-force g‘, in which it was assumed to be "small" compared with the four- 
force (e/c)F "ux. It is sufficient to satisfy the requirement that the components of one vector be small com- 
pared to those of another in just one frame of reference; by virtue of relativistic invariance, the four- 
dimensional formulas obtained on the basis of such an assumption will be valid in any other reference frame. 
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or, substituting the expression for A(x), 


tx 

_ 2 e à 

654 = o (s) f [(E,— H,Y +(E,+H,)}’]dx. (76.6) 
PROBLEMS 


1. Calculate the limiting energy which a particle can have after passing through the field of a 
magnetic dipole n; the vector m and the direction of motion lie in a plane. 

Solution: We choose the plane passing through the vector m and the direction of motion as the 
XZ plane, where the particle moves parallel to the X axis at a distance o from it. For the transverse 
components of the field of the magnetic dipole we have (see 44.4): 


H, — 0, 
3(m-r). — m,r? m . 21.3 
— n a ye (le cos @ +x sin $)e — (p? + x?) coso} 
(¢ is the angle between m and the Z axis). Substituting in (76.6) and performing the integration, 
we obtain 


H.= 


1 m?z e? \? 
E ;] (15--26 cos? ¢). 
Seri, 64m?cío? (za) (053 os 9) 
2. Write the three-dimensional expression for the damping force in the relativistic case. 
Solution: Calculating the space components of the four-vector (76.3), we find 


3 2N - 1/2 e h] 
f= 26 1-5 — +v: V E+ yx —-+v:VJH$+ 
3mc? c? et c et 


4 
2e {Ext are Be B) - 


T gmet 
2e 2 
- — x Y (gio) - B €». 
3m?c? (1-5) 
C 


§ 77. Spectral resolution of the radiation in the ultrarelativistic case 


Earlier (in $ 73) it was shown that the radiation from an ultrarelativistic particle is directed 
mainly in the forward direction, along the velocity of the particle: it is contained almost 
entirely within the small range of angles 


Ad~ Jl- 


around the direction of v. 

In evaluating the spectral resolution of the radiation, the relatton between the magnitude 
of the angular range A0 and the angle of deflection « of the particle in passing through the 
external electromagnetic field is essential. 

The angle « can be calculated as follows. The change in the transverse (to the direction of 
motion) momentum of the particle is of the order of the product of the transverse force eFt 


t If we choose the X axis along the direction of motion of the particle, then (eF)? is the sum of the squares 
of the y and z components of the Lorentz force, eE + ev/c xH, in which we can here set v z c: 


Fe= (E, — H) + (E,4+H,)*. 
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and the time of passage through the field, tf ~ a/v = a/c (where a is the distance within 

which the field is significantly different from 0). The ratio of this quantity to the momentum 
mo " mc 

Vi-vje Vi- 


determines the order of magnitude of the small angle a: 


p= 


eFa v? 
MES c? 
Dividing by A0, we find: 
a eFa 
A ~ me?" (77.1) 


We call attention to the fact that the ratio does not depend on the velocity of the particle, 
and is completely determined by the properties of the external field itself. 
We assume first that 
eFa > mc’, (77.2) 


that is, the total deflection of the particle is large compared with A0. Then we can say that 
radiation in a given direction occurs mainly from that portion of the trajectory in which the 
velocity of the particle is almost parallel to that direction (subtending with it an angle in the 
interval A0) and the length of this segment 1s small compared with a. The field F can be 
considered constant within this segment, and since a small segment of a curve can be con- 
sidered as an arc of a circle, we can apply the results obtained in § 74 for radiation during 
uniform motion in a circle (replacing H by F). In particular, we may state that the main part 
of the radiation is concentrated in the frequency range 


F 
w~ + (77.3) 
mc ( - 5) 
[see (74.16)]. 
In the opposite limiting case, 
eFa < mc’, (77.4) 


the total angle of deflection of the particle is small compared with A@. In this case the 
radiation is directed mainly into the narrow angular range A@ around the direction of motion, 
while radiation arrives at a given point from the whole trajectory. 

To compute the spectral distribution of the intensity, it 1s convenient to start in this case 
from the Lienard—Wiechert expressions (73.8) for the field in the wave zone. Let us compute 
the Fourier component 


2n 


b us 
Ec | Ee dt. 


The expression on the right of formula (73.8) is a function of the retarded time t’, which is 
determined by the condition t’ = 1—R(t’)/c. At large distances from a particle which is 
moving with an almost constant velocity v, we have: 


R I R ] 
Peg +-n r(t) = t— — + nevi 
c c c c 
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(r=r(t’) = vt' is the radius vector of the particle), or 


We replace the z integration by an integration over ¢’, by setting 


dt = dr ( — ==), 
C 


a 


and obtain: 
ikRo , nev 
E, = M UE Í nx {(»- ‘) xw) geri 5) gy. 
Cc n'y [4 
«e 
[4 


7o 


We treat the velocity v as constant; only the acceleration w(t’) is variable. Introducing the 


notation 
ie gi ( = ="), (77.5) 


and the corresponding frequency component of the acceleration, we write E, in the form 


bar ePo / wN? ( ae 
=- —) nx —-) xw,,>. 
°” e? Ro 2) n i J 2 


Finally from (66.9) we get for the energy radiated into solid angle do, with frequency in dw: 


e? /a\* Y yt. dw 
dE nw -z5() nx f(a- J Tw] do pn (77.6) 


An estimate of the order of magnitude of the frequencies in which the radiation is mainly 
concentrated in the case of (77.4) is easily made by noting that the Fourier component w, 
is significantly different from zero only if the time 1/o', or 


1 


s 
foi 
»(1- 5) 


is of the same order as the time a/v ~ a/c during which the acceleration of the particle 
changes significantly. Therefore we find: 


je (77.7) 


« (1-5) 


The energy dependence of the frequency is the same as in (77.3), but the coefficient is 
different. 

In the treatment of both cases (77.2) and (77.4) it was assumed that the total loss of energy 
by the particle during its passage through the field was relatively small. We shall now show 
that the first of these cases also covers the problem of the radiation by an ultrarelativistic 
particle, whose total loss of energy is comparable with its initial energy. 
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The total loss of energy by the particle in the field can be determined from the work of the 
Lorentz frictional force. The work done by the force (76.4) over the path ~ a is of order 


af~ 


e] iz 


the field must exist at distances 


F m? c* 


since otherwise we could not even apply ordinary electrodynamics. 


PROBLEMS 


1. Determine the spectral distribution of the total (over all directions) radiation intensity for the 
condition (77.2). 

Solution: For each element of length of the trajectory, the radiation is determined by (74.13), 
where we must replace H by the value of the transverse force F at the given point and, in addition, 
we must go over from a discrete to a continuous frequency spectrum. This transformation is accom- 
plished by formally multiplying by dn and the replacement 


dn dw 
l dn = l, —— do = 1, —. 
dw Wo 
Next, integrating over all time, we obtain the spectral distribution of the total radiation in the 


following form: 


dês = -do 22 (4.5 Lov) cL | odul dt 
n o ac A E GT. u) du ; 


where ®(z) is the Airy function of the argument 


The integrand depends on the integration variable ¢ implicitly through the quantity u (F and with 
it u, varies along the trajectory of the particle; for a given motion this variation can be considered 
as a time dependence). 
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2. Determine the spectral distribution of the total (over all directions) radiated energy for the 
condition (77.4). 

Solution: Keeping in mind that the main role is played by the radiation at small angles to the 
direction of motion, we write: 


02 3 32 
of =o (1~ 2008 eje (1-245) $ (1-3 +8). 


We replace the integration over angles do = sin 0d0 dé ~ 0d0 dé in (77.6) by an integration over 
d$ dw'[c. In writing out the square of the vector triple product in (77.6) it must be remembered that 
in the ultrarelativistic case the longitudinal component of the acceleration is small compared with 
the transverse component [in the ratio 1 — (c7/c?)], and that in the present case we can, to sufficient 
accuracy, consider w and v to be mutually perpendicular. As a result, we find for the spectral 
distribution of the total radiation the following formula: 


e 


e?o do wel? e v? o? v 
S Eu cert pec ql et 2 
dés 2nc? Í o? | o ( =) F 2t l c? do 


Z c 


§ 78. Scattering by free charges 


If an electromagnetic wave falls on a system of charges, then under its action the charges 
are set in motion. This motion in turn produces radiation in all directions; thereoccurs, we 
say, a scattering of the original wave. ; 

The scattering is most conveniently characterized by the ratio of the amount of energy 
emitted by the scattering system in a given direction per unit time, to the energy flux density 
of the incident radiation. This ratio clearly has dimensions of area, and is called the effective 
scattering cross-section (or simply the cross-section). 

Let d7 be the energy radiated by the system into solid angle do per second for an incident 
wave with Poynting vector S. Then the effective cross-section for scattering (into the solid 
angle do) is 


do = — (78.1) 


(the dash over a symbol means a time average). The integral o of do over all directions is the 
total scattering cross-section. 

Let us consider the scattering produced by a free charge at rest. Suppose there is incident 
on this charge a plane monochromatic linearly polarized wave. Its electric field can be 
written in the form 

E = E, cos (k r— wt +a). 

We shall assume that the velocity acquired by the charge under the influence of the incident 
wave is small compared with the velocity of light (which is usually the case). Then we can 
consider the force acting on the charge to be eE, while the force (e/c)v x H due to the mag- 
netic field can be neglected. In this case we can also neglect the effect of the displacement of 
the charge during its vibrations under the influence of the field. 1f the charge carries out 
vibrations around the coordinate origin, then we can assume that the field which acts on the 
charge at all times is the same as that at the origin, that is, 


E = E, cos (wt — a). 
Since the equation of motion of the charge is 
mi = eE 
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and its dipole moment d = er, then 


e 


d ——E. (78.2) 
m 
To calculate the scattered radiation, we use formula (67.7) for dipole radiation (this is 
justified, since the velocity acquired by the charge is assumed to be small). We also note that 
the frequency of the wave radiated by the charge (1.e., scattered by it) is clearly the same as 
the frequency of the incident wave. 


Substituting (78.2) in (67.7), we find 


4 


e 
dl = ——— (E xn’? do. 
Axnm?c? ( ) 


where n’ is a unit vector in the scattering direction. On the other hand, the Poynting vector 
of the incident wave is 


C 3 
S-—E. 78.3 
4n ( 


From this we find, for the cross-section for scattering into the solid angle do, 


2 2 
de = (5) sin? 0 do, (78.4) 
mc 


where 6 is the angle between the direction of scattering (the vector n), and the direction of 
the electric field E of the incident wave. We see that the effective scattering cross-section of a 
free charge is independent of frequency. 

We determine the total cross-section v. To do this, we choose the polar axis along E. 
Then do = sin 0 dð do; substituting this and integrating with respect to 0 from O to z, 


and over $ from 0 to 27, we find 
82 / e? \? 
o= i) (78.5) 
(This is the Thomson formula.) 

Finally, we calculate the differential cross-section do in the case where the incident wave 
is unpolarized (ordinary light). To do this we must average (78.4) over all directions of the 
vector E in a plane perpendicular to the direction of propagation of the incident wave 
(direction of the wave vector k). Denoting by e the unit vector along the direction of E, we 
write: 


sin? 6 = 1—(n-e)? = 1—n,nge,eg. 


The averaging is done using the formulat 


l kk 
DE (su - S), (78.6) 


and gives 
—— d (n-k)? 1 
sin? 0 =5( + o>) = aC +cos? ©) 


where O is the angle between the directions of the incident and scattered waves (the scattering 
angle). Thus the effective cross-section for scattering of an unpolarized wave by a free charge 


t In fact, e, eg is a symmetric tensor with trace equal to 1, which gives zero when multiplied by k., because 
e and k are perpendicular. The expression given here satisfies these conditions. 
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1S 


1 e 2 
do = ( =) (1 4- cos? ©) do, (78.7) 


2Xnc 


- 


The occurrence of scattering leads, in particular, to the appearance of a certain force 
acting on the scattering particle. One can verify this by the following considerations. On the 
average, in unit time, the wave incident on the particle loses energy c Wo, where W is the 
average energy density, and ø is the total effective scattering cross-section. Since the momen- 
tum of the field is equal to its energy divided by the velocity of light, the incident wave loses 
momentum equal in magnitude to We. On the other hand, in a system of reference in which 
the charge carries out only small vibrations under the action of the force eE, and its velocity 
v is small, the total flux of momentum in the scattered wave is zero, to terms of higher order 
in v/c (in § 73 it was shown that in a reference system in which r = 0, radiation of momentum 
by the particle does not occur). Therefore all the momentum lost by the incident wave is 
"absorbed" by the scattering particle. The average force f acting on the particle is equal to 
the average momentum absorbed per unit time, i.e. 


f=cWn, (78.8) 
(mg is a unit vector in the direction of propagation of the incident wave). We note that the 
average force appears as a second order quantity in the field of the incident wave, while the 
"instantaneous" force (the main part of which is eE) is of first order in the field. 
Formula (78.8) can also be obtained directly by averaging the damping force (75.10). The 
first term, proportional to É, goes to zero on averaging, as does the average of the main part 
of the force, eE. The second term gives 


2e* — JE (y= 


—|—n, 
mc?] 4n ” 


which, using (78.5), coincides with (78.8). 


PROBLEMS 


1. Determine the effective cross-section for scattering of an elliptically polarized wave by a free 
charge. 

Solution: The field of the wave has the form E = A cos (ct à) ! B sin (o£ o), where A and B 
are mutually perpendicular vectors (see § 48). By a derivation similar to the one in the text, we find 


e? NX (A~n)?! (Bx ny 
de = G5) A? B? do. 


2. Determine the effective cross-section for scattering of a linearly polarized wave by a charge 
carrying out small vibrations under the influence of an elastic force (oscillator). 
Solution: The equation of motion of the charge in the incident field E = Eo cos (ef — x) is 


" e ; 
for — Eocos(ot-- o), 
m 


where wo is the frequency of its free vibrations. For the forced vibrations, we then have 


. eEo cos (cot -- a) 


Calculating d from this, we find 
e? \? ot in? 0d 
do — (=) mao To sin? 0 do 


(8 is the angle between E and n^). 
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3. Determine the total effective cross-section for scattering of light by an electric dipole which, 
mechanically, is a rotator. The frequency «» of the wave is assumed to be large compared with the 
frequency Qo of free rotation of the rotator. 

Solution: Because of the condition w >> Qo, we can neglect the free rotation of the rotator, and 
consider only the forced rotation under the action of the moment of the forces d x E exerted on it 
by the scattered wave. The equation for this motion is: JQ — dX E, where J is the moment of 
inertia of the rotator and Q is the angular velocity of rotation. The change in the dipole moment 
vector, as it rotates without changing its absolute value, is given by the formula d = Qd. From 
these two equations, we find (omitting the quadratic term in the small quantity Q): 


4, 1 I 
d = 5 (dx E)x d = ; [Ed?—€ - d)d]. 


Assuming that all orientations of the dipole in space are equally probable, and averaging d? over 
them, we find for the total effective cross-section, 


| l6zd* 
(9c? 
4. Determine the degree of depolarization in the scattering of ordinary light by a free charge. 
Solution: From symmetry considerations, it is clear that the two incoherent polarized com- 
ponents of the scattered light (see $ 50) will be linearly polarized: one in the plane of scattering (the 
plane passing through the incident and scattered waves) and the other perpendicular to this plane. 
The intensities of these components are determined by the components of the field of the incident 
wave in the plane of scattering (E,) and perpendicular to it (E,), and, according to (78.4), are 
proportional respectively to 


(E, <n)? = Ejcos?O and (E, xn’)? = Ej 
(where O is the angle of scattering). Since for the ordinary incident light, E? = ET. the degree of 
depolarization [see the definition in (50.9)] is: 


[^3 


p — cos? ©. 


5. Determine the frequency «' of the light scattered by a moving charge. 
Solution: In a system of coordinates in which the charge is at rest, the frequency of thc light does 
not change on scattering (e = c»). This relation can be written in invariant form as 


k'u = kat, 


where u! is the four-velocity of the charge. From this we find without difficulty 


co’ (1— 20s e) =o ( — Ë cos 0), 
c c 


where 0 and 6’ are the angles made by the incident and scattered waves with the direction of motion 
(v is the velocity of the charge). 

6. Determine the angular distribution of the scattering of a linearly polarized wave by a charge 
moving with velocity v in the direction of propagation of the wave. 

Solution: The velocity of the particle is perpendicular to the fields E and H of the incident wave, 
and is therefore also perpendicular to the acceleration w given to the particle. The scattered intensity 
is given by (73.14), where the acceleration w of the particle must be expressed in terms of the fields 
E and H of the incident wave by the formulas obtained in the problem in § 17. Dividing the intensity 
dl by the Poynting vector of the incident wave, we get the following expression for the scattering 
cross-section: 

2 A2 


e v ((-3)(-3) Lae 
do - (25) i [(1-2sin 0 cos ) spen) cos? o| do, 


(1- ism 0 cos ¢ 
c 


where 0 and ¢ are the polar angle and azimuth of the direction n’ relative to a system of coordinates 
with Z axis along E, and X axis along v (cos (n', E) = cos 9; cos (n’, v) = sin 9 cos ¢). 
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7. Calculate the motion of a charge under the action of the average force exerted upon it by the 
wave scattered by it. 

Solution: The force (78.8), and therefore the velocity of the motion under consideration, is along 
the direction of propagation of the incident wave (X axis). In the auxiliary reference system Ko, 
in which the particle is at rest ( we recall that we are dealing with the motion averaged over the 
period of the small vibrations), the force acting on it is e Wọ, and the acceleration acquired by it 
under the action of this force is 


e — 
Wo =— Wo 
m 
(the index zero refers to the reference system Ko). The transformation to the original reference 


system K (in which the charge moves with velocity v) is given by the formulas obtained in the problem 
of § 7 and by formula (47.7), and gives: 


v 

d v ! do We c 
=., 2\3/2 fy mo n 

dt _ (1-5) dt mye 
c c c 


which determines the velocity v = dx/dt as an implicit function of the time (the integration con- 
stant has been chosen so that r = 0 at ¢ = 0). 


8. Determine the effective cross-section for scattering of a linearly polarized wave by an oscillator, 
taking into account the radiation damping. o , 
Solution: We write the equation of motion of the charge in the incident field in the form 
2e? _ 


In the damping force, we can substitute approximately F = — oùt; then we find 
e , 
P+ yi+ ogr = Ee, 
m 
where y = (2e?/3mc?)«$. From this we obtain 
e e` tut 
r= — Boa, 
m og — 07 — iwy 
The effective cross-section is 


8n/ e? V w’ 
6 = — | —; eC 
3 mc? (ói— oy + wy? 


§ 79. Scattering of low-frequency waves 


The scattering of a wave by a system of charges ditfers from the scattering by a single 
charge (at rest), first of all in the fact that because of the presence of internal motion of the 
charges of the system, the frequency of the scattered radiation can be different from the 
frequency of the incident wave. Namely, in the spectral resolution of the scattered wave 
there appear, in addition to the frequency w of the incident wave, frequencies w’ differing 
from w by one of the internal frequencies of motion of the scattering system. The scattering 
with changed frequency is called incoherent (or combinational), in contrast to the coherent 
Scattering without change in frequency. 
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Assuming that the field of the incident wave is weak, we can represent the current density 
in the form j = jọ +j’, where jo is the current density in the absence of the external field, and 
j is the change in the current under the action of the incident wave. Correspondingly, the 
vector potential (and other quantities) of the field of the system also has the form A =A +A’, 
where A, and A’ are determined by the currents jy and j’. Clearly, A’ describes the wave 
scattered by the system. 

Let us consider the scattering of a wave whose frequency w is small compared with all the 
internal frequencies of the system. The scattering will consist of an incoherent as well as a 
coherent part, but we shall here consider only the coherent scattering. 

In calculating the field of the scattered wave, for sufficiently low frequency w, we can use 
the expansion of the retarded potentials which was presented in §§ 67 and 71, even if the 
velocities of the particles of the system are not small compared with the velocity of light. 
Namely, for the validity of the expansion of the integral 


it is necessary only that the time r-n’/c~a/c be small compared with the time I/w; for 
sufficiently low frequencies (w < c/a), this condition is fulfilled independently of the velocities 
of the particles of the system. 

The first terms in the expansion give 


H’ = Ens (d' x n' +(i xn’) x n'j, 
where d', nv’ are the parts of the dipole and magnetic moments of the system which are 
produced by the radiation falling on the system. The succeeding terms contain higher time 
derivatives than the second, and we drop them. 
The component Hi’, of the spectral resolution of the field of the scattered wave, with 
frequency equal to that of the incident wave, is given by this same formula, when we sub- 


stitute for all quantities their Fourier components: d;, = —«d;,, ii, = —c?m/,. Then we 
obtain 5 
, e fm’ D r L , 
H, = iR x dj E n' x (mj, x nj. (79.1) 
C Ro 


The later terms in the expansion of the field would give quantities proportional to higher 
powers of the small frequency. If the velocities of all the particles of the system are small 
(v < c), then in (79.1) we can neglect the second term in comparison to the first, since the 
magnetic moment contains the ratio r/c. Then 


l 
Hu = oR, wn! x di,. (79.2) 

If the total charge of the system is zero, then for w — 0, d;, and m;, approach constant 
limits (if the sum of the charges were different from zero, then for w = 0, i.e. for a constant 
field, the system would begin to move as a whole). Therefore for low frequencies (w <v/a) 
we can consider d;, and nr, as independent of frequency, so that the field of the scattered 
wave is proportional to the square of the frequency. Its intensity is consequently propor- 
tional to w*. Thus for the scattering of a low-frequency wave. the effective cross-section for 
(coherent) scattering is proportional to the fourth power of the frequency of the incident 
radiation.t 

* This also applies to the scattering of light by ions as well as by neutral atoms. Because of the large mass 
of the nucleus, the scattering resulting from the motion of the ion as a whole can be neglected. 
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§ 80. Scattering of high-frequency waves 


We consider the scattering of a wave by a system of charges in the opposite limit, when the 
frequency w of the wave is large compared with the fundamental internal frequencies of the 
system. The latter have the order of magnitude wọ ~ v/a, so that w must satisfy the condition 


w> Wy ~=. (80.1) 


In addition, we assume that the velocities of the charges of the system are small (v < c). 

According to condition (80.1), the periods of the motion of the charges of the system are 
large compared with the period of the wave. Therefore during a time interval of the order 
of the period of the wave, the motion of the charges of the system can be considered uniform. 
This means that in considering the scattering of short waves, we need not take into account 
the interaction of the charges of the system with each other, that is, we can consider them as 
free. 

Thus in calculating the velocity v', acquired by a charge in the field of the incident wave, 
we can consider each of the charges in the system separately, and write for it an equation of 
motion of the form 


, 


m dv = eE = eE,e "n, 
dt 
where k = (w/c)n is the wave vector of the incident wave. The radius vector of the charge 
is, of course, a function of the time. In the exponent on the right side of this equation the 
time rate of change of the first term is large compared with that of the second (the first 
is œw, while the second is of order Av ~ v(w/c) < œ). Therefore in integrating the equation 
of motion, we can consider the term r on the right side as constant. Then 


e ; 
v = — — Ege Marek), (80.2) 

tom 
For the vector potential of the scattered wave (at large distances from the system), we have 


from (79.1): 


l 
A = CRo 3 (ev), - fe 


where the sum goes over all the charges of the system. Substituting (80.2), we find 


1 -io lt- Ro e? . 
iw : Lo p-idq'r 
icRow e ( “JE » m* ' (80.3) 


A'-- 


where q = k’—k is the difference between the wave vector k = (w/c)n of the incident wave, 
and the wave vector k' = (c/c)n' of the scattered wave.t The value of the sum in (80.3) 
must be taken at the time z’ = ¢—(R,/c) (for brevity as usual, we omit the index t’ on r); 
the change of rin the timer : n'/c can be neglected in view of our assumption that the velocities 
of the particles are small. The absolute value of the vector q is 


o 


w, 
q= 2- sin >? (80.4) 


where © is the scattering angle. 


+ Strictly speaking, the wave vector k' = co'n'/c, where the frequency w of the scattered wave may differ 
from w. However, in the present case of high frequencies the difference w — c ~ wo can be neglected. 
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For scattering by an atom (or molecule), we can neglect the terms in the sum in (80.3) 
which come from the nuclei, because their masses are large compared with the electron mass. 
Later we shall be looking at just this case, so that we remove the factor e?/m from the sum- 
mation sign, and understand by e and # the charge and mass of the electron. 

For the field H’ of the scattered wave we find from (66.3): 


E,xn _. C Ro) e? ; 
= — e 4 77]—5 e 7t, 80.5 
c Ro "m » ( ) 
The energy flux into an element of solid angle in the direction n' is 
H {2 4 2 
ciH | 5 e d 
R&do = ———, (nx E o 778775 do, 
8x ° 8nc?^n? ( oy De 


Dividing this by the energy flux clEo|?/87 of the incident wave, and introducing the angle 0 
between the direction of the field E of the incident wave and the direction of scattering, we 
finally obtain the effective scattering cross-section in the form 


2 
sin? 6 do. (80.6) 


2\2 
«(2 
mc 


Zear 


The dash means a time average, i.e. an average over the motion of the charges of the system; 
it appears because the scattering is observed over a time interval large compared with the 
periods of motion of the charges of the system. 

For the wavelength of the incident radiation, there follows from the condition (80.1) the 
inequality 4 « ac/v. As for the relative values of 4 and a, both the limiting cases 2 > a 
and A < a are possible. In both these cases the general formula (80.6) simplifies considerably. 

In the case of A > a, in the expression (80.6) q-r < 1, since q ~ 1/2, and r is of order of a. 
Replacing e~it" by unity in accordance with this, we have: 


Z 2X2 
o= (2) sin? Odo (80.7) 


that is, the scattering is proportional to the square of thc atomic number Z. 
We now go over to the case of 2 < a. In the square of the sum which appears in (80.6), in 
addition to the square modulus of each term, there appear products of the forme ^! * (' ^ *?», 
In averaging over the motion of the charges, i.e. over their mutual separations, F; —F3 
takes on values in an interval of order a. Since q ~ 1/4, à «a, the exponential factor 
e iv 01712 js a rapidly oscillating function in this interval, and its average value vanishes. 
Thus for 2 < a, the effective scattering cross-section is 


e? 2 
do =Z (=) sin? 0 do, (80.8) 


that is, the scattering is proportional to the first power of the atomic number. We note that 
this formula is not applicable for small angles of scattering (© ~ 4/a), since in this case 
q ^ 9/À ~ l/a and the exponent q-r is not large compared to unity. 

To determine the effective coherent scattering cross-section, we must separate out that 
part of the field of the scattered wave which has the frequency w. The expression (80.5) 
depends on the time through the factor e^/"*', and also involves the time in the sum 
Ee ''. This latter dependence leads to the result that in the field of the scattered wave there 
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are contained, along with the frequency w, other (though close to w) frequencies. That part 
of the field which has the frequency w (i.e. depends on the time only through the factor 
e~‘'), is obtained if we average the sum X e ^'*'" over time. In accordance with this, the 
expression for the effective coherent scattering cross-section do.o,, differs from the total 
cross-section do in that it contains, in place of the average value of the square modulus of 
the sum, the square modulus of thc average value of the sum, 


e? 2 
doon = | — ea 
h 
(2) X 


It is uscful to notc that this average value of the sum is (except for a factor) just the space 
Fourier component of the average distribution g(r) of the electric charge density in the 
atom: 


2 
sin? 0 do. (80.9) 


e Eev- [eee dV =Q,- (80.10) 
In case 4 > a, we can again replace e*'" by unity, so that 
e? 2 
do, = (z <=) sin? 0 do. (80.11) 
mc 


Comparing this with thc total effective cross-scction (80.7), we sce that dacon = doc, that is, 
all the scattering is coherent. 

If 2 <a, then when we average in (80.9) all the terms of the sum (being rapidly oscillating 
functions of the time) vanish, so that do,,, = 0. Thus in this case the scattering is completely 


incoherent. 


CHAPTER 10 


PARTICLE IN A GRAVITATIONAL FIELD 


§ 81. Gravitational fields in nonrelativistic mechanics 


Gravitational fields, or fields of gravity, have the basic property that all bodies move in 
them in the same manner, independently of mass, provided the initial conditions are the 
same. 

For example, the laws of free fall in the gravity field of the earth are the same for all 
bodies; whatever their mass, all acquire one and the same acceleration. 

This property of gravitational fields provides the possibility of establishing an analogy 
between the motion of a body in a gravitational field and the motion of a body not located 
in any external field, but which is considered from the point of view of a noninertial system 
of reference. Namely, in an inertial reference system, the free motion of all bodies is uniform 
and rectilinear, and if, say, at the initial time their velocities are the same, they will be the 
same for all times. Clearly, therefore, if we consider this motion in a given noninertial 
system, then relative to this system all the bodies will move in the same way. 

Thus the properties of the motion in a noninertial system are the same as those in an 
inertial system in the presence of a gravitational field. In other words, a noninertial reference 
system is equivalent to a certain gravitational field. This is called the principle of equivalence. 

Let us consider, for example, motion in a uniformly accelerated reference system. A body 
of arbitrary mass, freely moving in such a system of reference, clearly has relative to this 
system a constant acceleration, equal and opposite to the acceleration of the system itself. 
The same applies to motion in a uniform constant gravitational field, e.g. the field of gravity 
of the earth (over small regions, where the field can be considered uniform). Thus a uni- 
formly accelerated system of reference is equivalent to a constant, uniform external field. 
In the same way, nonuniformity accelerated linear motion of the reference system is clearly 
equivalent to a uniform but gravitational field. 

However, the fields to which noninertial reference systems are equivalent are not com- 
pletely identical with "actual" gravitational fields which occur also in inertial frames. For 
there is a very essential difference with respect to their behavior at infinity. At infinite 
distances from the bodies producing the field, "actual" gravitational fields always go to 
zero. Contrary to this, the fields to which noninertial frames are equivalent increase without 
limit at infinity, or, in any event, remain finite in value. Thus, for example, the centrifugal 
force which appears in a rotating reference system increases without limit as we move away 
from the axis of rotation; the field to which a reference system in accelerated linear motion is 
equivalent is the same over all space and also at infinity. 
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The fields to which noninertial systems are equivalent vanish as soon as we transform to 
an inertial system. In contrast to this, "actual" gravitational fields (existing also in an inertial 
reference frame) cannot be eliminated by any choice of reference system. This is already 
clear from what has been said above concerning the difference in conditions at infinity 
between “actual” gravitational fields and fields to which noninertial systems are equivalent; 
since the latter do not approach zero at infinity, it is clear that it is imposstble, by any 
choice of reference frame, to eliminate an “actual” field, since it vanishes at infinity. 

All that can be done by a suitable choice of reference system is to eliminate the gravita- 
tional field in a given region of space, sufficiently small so that the field can be considered 
uniform over it. This can be done by choosing a system in accelerated motion, the accelera- 
tion of which is equal to that which would be acquired by a particle placed in the region of 
the field which we are considering. 

The motion of a particle in a gravitational field is determined, in nonrelativistic mechanics, 
by a Lagrangian having (in an inertial reference frame) the form 


mv? 


L=- m$, (81.1) 


where 6 is a certain function of the coordinates and time which characterizes the field and 
is called the gravitational potential.t Correspondingly, the equation of motion of the particle 
iS 

y = — grad $. (81.2) 


It does not contain the mass or any other constant characterizing the properties of the 
particle; this is the mathematical expression of the basic property of gravitational fields. 


$ 82. The gravitational field in relativistic mechanics 


The fundamental property of gravitational fields that all bodies move in them in the same 
way, remains valid also in relativistic mechanics. Consequently there remains also the 
analogy between gravitational fields and noninertial reference systems. Therefore in studying 
the properties of gravitational fields in relativistic mechanics, we naturally also start from 
this analogy. 

In an inertial reference system, in cartesian coordinates, the interval ds is given by the 
relation: 

ds? = c*dt? - dx? - dy? - dz?. 


Upon transforming to any other inertial reference system (i.e. under Lorentz transforma- 
tion), the interval, as we know, retains the same form. However, if we transform to a non- 
inertial system of reference, ds? will no longer be a sum of squares of the four coordinate 
differentials. 

So, for example, when we transform to a uniformly rotating system of coordinates, 


Li 


X =x’ cos Qt — y' sin Qt, y = x' sin Qt4- y' cos Qt, Z=2 


(Q is the angular velocity of the rotation, directed along the Z axis), the interval takes on the 


+ In what follows we shall seldom have to use the electromagnetic potential ¢, so that the designation of 
the gravitational potential by the same symbol cannot lead to misunderstanding. 
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form 
ds? = [c? Q(x? + y jdt? — dx? - dy? — dz? +20y' dx' dt—2Qx' dy’ dt. 


No matter what the law of transformation of the time coordinate, this expression cannot be 
represented as a sum of squares of the coordinate differentials. 

Thus in a noninertial system of reference the square of an interval appears as a quadratic 
form of general type in the coordinate differentials, that is, it has the form 


ds? = g,dx'dx*, (82.1) 


where the gj, are certain functions of the space coordinates x!, x”, x? and the time co- 
ordinate x?. Thus, when we use a noninertial system, the four-dimensional coordinate 
system x°, x! , x?, x? is curvilinear. The quantities gą, determining all the geometric properties 
in each curvilinear system of coordinates, represent, we say, the space-time metric. 

The quantities g;, can clearly always be considered symmetric in the indices i and k 
(g,; = Gi), Since they are determined from the symmetric form (82.1), where g;, and g,; enter 
as factors of one and the same product dx! dx*. In the general case, there are ten different 
quantities g;,—four with equal, and 4-3/2 = 6 with different indices. In an inertial reference 
system, when we use cartesian space coordinates x! ?'? = x, y, z, and the time, x? = ct, 
the quantities g,, are 


do 7l 911=92=933=-1, gą=0 for i#k. (82.2) 


We call a four-dimensional system of coordinates with these values of g; galilean. 

In the previous section it was shown that a noninertial system of reference is equivalent 
to a certain field of force. We now see that in relativistic mechanics, these fields are deter- 
mined by the quantities gj. 

The same applies also to "actual" gravitational fields. Any gravitational field is just a 
change in the metric of space-time, as determined by the quantities g,,. This important fact 
means that the geometrical properties of space-time (its metric) are determined by physical 
phenomena, and are not fixed properties of space and time. 

The theory of gravitational fields, constructed on the basis of the theory of relativity, is 
called the general theory of relativity. It was established by Einstein (and finally formulated 
by him in 1915), and represents probably the most beautiful of all existing physical theories. 
It is remarkable that it was developed by Einstein in a purely deductive manner and only 
later was substantiated by astronomical observations. 

As in nonrelativistic mechanics, there is a fundamental difference between "actual" 
gravitational fields and fields to which noninertial reference systems are equivalent. Upon 
transforming to a noninertial reference system, the quadratic form (82.1), i.e. the quantities 
gi, are obtained from their galilean values (82.2) by a simple transformation of coordinates, 
and can be reduced over all space to their galilean values by the inverse coordinate trans- 
formation. That such forms for g,, are very special is clear from the fact that it is impossible 
by a mere transformation of the four coordinates to bring the ten quantities g,, to a pre- 
assigned form. 

An "actual" gravitational field cannot be eliminated by any transformation of co- 
ordinates. In other words, in the presence of a gravitational field space-time is such that the 
quantities g, determining its metric cannot, by any coordinate transformation, be brought 
to their galilean values over all space. Such a space-time is said to be curved, in contrast to 
flat space-time, where such a reduction is possible. 
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By an appropriate choice of coordinates, we can, however, bring the quantities gj to 
galilean form at any individual point of the non-galilean space-time: this amounts to the 
reduction to diagonal form of a quadratic form with constant coefficients (the values of gj, 
at the given point). Such a coordinate system is said to be galilean for the given point.t 

We note that, after reduction to diagonal form at a given point, the matrix of the quanti- 
ties gą has one positive and three negative principal values.t From this it follows, in par- 
ticular, that the determinant g, formed from the quantities g,,, is always negative for a real 
space-time: 

g <0. (82.3) 


A change in the metric of space-time also means a change in the purely spatial metric. 
To a galilean g, in flat space-time, there corresponds a euclidean geometry of space. In a 
gravitational field. the geometry of space becomes non-euclidean. This applies both to 
"true" gravitational fields. in which space-time is "curved", as well as to fields resulting 
from the fact that the reference system is non-inertial, which leave the space-time flat. 

The problem of spatial geometry in a gravitational field will be considered in more detail 
in § 84. It is useful to give here a simple argument which shows pictorially that space will 
become non-euclidean when we change to a non-inertial system of reference. Let us con- 
sider two reference frames. of which one (K) is inertial, while the other (K’) rotates uniformly 
with respect to K around their common c axis. A circle in the x, y plane of the K system 
(with its center at the origin) can also be regarded as a circle in the x’, »' plane of the K’ 
system. Measuring the length of the circle and its diameter with a yardstick in the K system, 
we obtain values whose ratio is equal to z, in accordance with the euclidean character of 
the geometry in the inertial reference system. Now let the measurement be carried out with 
a yardstick at rest relative to K’. Observing this process from the K system, we find that the 
yardstick laid along the circumference suffers a Lorentz contraction, whereas the yardstick 
placed radially is not changed. It is therefore clear that the ratio of the circumference to the 
diameter, obtained from such a measurement, will be greater than 7. 

In the general case of an arbitrary, varying gravitational field, the metric of space is not 
only non-euclidean, but also varies with the time. This means that the relations between 
different geometrical distances change with time. As a result, the relative position of “test 
bodies" introduced into the field cannot remain unchanged in any coordinate system.$ Thus 
if the particles are placed around the circumference of a circle and along a diameter, since 
the ratio of the circumference to the diameter is not equal to x and changes with time, it is 
clear that if the separations of the particles along the diameter remain unchanged the separa- 
tions around the circumference must change, and conversely. Thus in the general theory 
of relativity it is impossible in general to have a system of bodies which are fixed relative to 
one another. 

This result essentially changes the very concept of a system of reference in the general 
theory of relativity, as compared to its meaning in the special theory. In the latter we meant 


t To avoid misunderstanding. we state immediately that the choice of such a coordinate system does not 
mean that the gravitational field has been eliminated over the corresponding infinitesimal volume of four- 
space. Such an elimination is also always possible, by virtuc of the principle of equivalence, and has a greater 
significance (see § 87). 

+ This set of signs is called the signature of the matrix. 

§ Strictly speaking. the number of particles should be greater than four. Since we can construct a tetra- 
hedron from any six line segments, we can always, by a suitable definition of the reference system, make a 
system of four particles form an invariant tetrahedron. A fortiori, we can fix the particles relative to one 
another in systems of three or two particles. 
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by a reference system a set of bodies at rest relative to one another in unchanging relative 
positions. Such systems of bodies do not exist in the presence of a variable gravitational field, 
and for the exact determination of the position of a particle in space we must, strictly speak- 
ing, have an infinite number of bodies which fill all the space like some sort of medium". 
Such a system of bodies with arbitrarily running clocks fixed on them constitutes a reference 
system in the general theory of relativity. 

In connection with the arbitrariness of the choice of a reference system, the laws of nature 
must be written in the general theory of relativity in a form which is appropriate to any four- 
dimensional system of coordinates (or, as one says, in “covariant” form). This, of course, 
does not imply the physical equivalence of all these reference systems (like the physical 
equivalence of all inertial reference systems in the special theory). On the contrary, the 
specific appearances of physical phenomena, including the properties of the motion of 
bodies, become different in all systems of reference. 


§ 83. Curvilinear coordinates 


Since, in studying gravitational fields we are confronted with the necessity of considering 
phenomena in an arbitrary reference frame, it is necessary to develop four-dimensional 
geometry in arbitrary curvilinear coordinates. Sections 83, 85 and 86 are devoted to this. 

Let us consider the transformation from one coordinate system, x9, x! , x?, x3, to another 

70 ʻi 12 3. 
NK XX 
xi = fx'9, xt. x", x'3), 
where the f' are certain functions. When we transform the coordinates, their differentials 
transform according to the relation 
Óx i 
dxi = ax’ dx'*. (83.1) 

Every aggregate of four quantities A‘ (i = 0, 1, 2, 3), which under a transformation of 

coordinates transform like the coordinate differentials, is called a contravariant four-vector: 
Aim DE an (83.2) 
an^ 3.2 

Let ġ be some scalar. Under a coordinate transformation, the four quantities ĉġ/êx' 

transform according to the formula 

ô 6d Ox'* 

a L (83.3) 

ax! Ox" Ox 
which is different from formula (83.2). Every aggregate of four quantities A; which, under a 
coordinate transformation, transform like the derivatives of a scalar, is called a covariant 
four-vector: 

Ox. 

A; = “ax! Ax. (83.4) 

Because two types of vectors appear in curvilinear coordinates, there are three types of 
tensors of the second rank. We call a contravariant tensor of the second rank, A‘, an aggregate 
of sixteen quantities which transform like the products of the components of two contra- 
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variant vectors, i.e. according to the law 


ik XE óx* "a (83.5) 
~ Ox' Qx'" 
A covariant tensor of rank two, transforms according to the formula 
Ax = = = Aims (83.6) 
CX OX 
and a mixed tensor transforms as follows: 
Ai = ud zm AS. (83.7) 
x" ex 


The definitions given here are the natural generalization of the definitions of four-vectors 
and four-tensors in galilean coordinates (§ 6), according to which the differentials dx! 
constitute a contravariant four-vector and the derivatives 6¢/éx' form a covariant four- 
vector.T 

The rules for forming four-tensors by multiplication or contraction of products of other 
four-tensors remain the same in curvilinear coordinates as they were in galilean coordinates. 
For example, it is easy to see that, by virtue of the transformation laws (83.2) and (83.4), the 
scalar product of two four-vectors A'B, is invariant: 

im 
AB, = 2 A'B, = -27 -A'B = A"B;, 

The unit four-tensor 6} is defined the same as before in curvilinear coordinates: its 
components are again 6, = 0 for i # k, and are equal to | for i = k. If A* isa four- -vector, 
then multiplying by 5j we get: 

A‘, = Al, 
i.e. another four-vector; this proves that 6; is a tensor. 

The square of the line element in curvilinear coordinates is a quadratic form in the 
differentials dx': 


ds? = g, dx'dx*, (83.8) 
where the g are functions of the coordinates; gix is symmetric in the indices i and k: 
ik = Jri (83.9) 


Since the (contracted) product of g,, and the contravariant tensor dx! dx* is a scalar, the 
gu, form a covariant tensor; it is called the metric tensor. 
Two tensors A; and B* are said to be reciprocal to each other if 


Aik B" = ól. 
In particular the contravariant metric tensor is the tensor g* reciprocal to the tensor gj, 


that is, 
gag" = ôi. (83.10) 


The same physical quantity can be represented in contra- or co-variant components. It is 
obvious that the only quantities that can determine the connection between the different 


+ Nevertheless, while in a galilean system the coordinates x' themselves (and not just their differentials) 
also form a four-vector, this is, of course, not the case in curvilinear coordinates. 
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forms are the components of the metric tensor. This connection is given by the formulas: 
Ai=g"A,, A, -944* (83.11) 
In a galilean coordinate system the metric tensor has components: 


1 0 0 0 


; 0 -1 0 0 
gi = gi = 0 0 i ol (83.12) 
0 0 0 -1 


Then formula (83.11) gives the familiar relation 4? = Ay, A} ^? = —A, 2 3, ete.t 
These remarks also apply to tensors. The transition between the different forms of a given 
physical tensor is accomplished by using the metric tensor according to the formulas: 


Ai, = gi! Au, Aik = gg" Aims 
etc, 

In § 6 we defined (in galilean coordinates) the completely antisymmetric unit pseudo- 
tensor e'*7, Let us transform it to an arbitrary system of coordinates, and now denote it by 
E". We keep the notation e'"" for the quantities defined as before by e°!?? = | (or 
0123 = 7 1). . 

Let the x" be galilean, and the x' be arbitrary curvilinear coordinates. According to the 
general rules for transformation of tensors, we have: 


ax! de ix ien, 


ikim _ 0x OX 
Ex’? Ax’? Ox" Ox" , 


or 
En = Je”, 


where J is the determinant formed from the derivatives @x'/@x’?, i.e. it is just the Jacobian 
of the transformation from the galilean to the curvilinear coordinates: 
A(x°, x!, x?, x3) 


J=- a, 
O(x'®, x'1, x'2, x3) 


This Jacobian can be expressed in terms of the determinant of the metric tensor g; (in the 
system x^). To do this we write the formula for the transformation of the metric tensor: 


i k 
ik Ox" Ox" Im(0) 
Ox" Qx'" ` 
and equate the determinants of the two sides of this equation. The determinant of the re- 
ciprocal tensor |g'*| = 1/g. The determinant |g/^'?| = —1. Thus we have I/g = —J?, and 


soJ- VN =g. 
Thus, in curvilinear coordinates the antisymmetric unit tensor of rank four must be 
defined as 


; 1 , 
En = ei^. (83.13) 
—9 


1 Whenever, in giving analogies, we use galilean coordinate systems, one should realize that such a 
system can be selected only in a flat space. In the case of a curved four-space, one should speak of a co- 
ordinate system that is galilean over a given infinitesimal element of four-volume, which can always be 
found. None of the derivations are affected by this change. 
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The indices of this tensor are lowered by using the formula 


prst = 
e SipBirÜis m = — 0 €um 


so that its covariant components are 
P 
Eis = N ZI eisim (83.14) 


In a galilean coordinate system x^ the integral of a scalar with respect to 
dQ! = dx'? dx'! dx'? dx" is also a scalar, i.e. the element dQ’ behaves like a scalar in 
the integration (8 6). On transforming to curvilinear coordinates x’, the element of integra- 
tion dQ’ goes over into 


A—— 


l i 
dQ’ ,dQ= . —gdQ. 


Thus, in curvilinear coordinates, when integrating over a four-volumc the quantity v — gd Q 
behaves like an invariant. 

All the remarks at the end of § 6 concerning elements of integration over hypersurfaces, 
surfaces and lines remain valid for curvilinear coordinates, with the one difference that the 
definition of dual tensors changes. The element of tarea” of the hypersurface spanned by 
three infinitesimal displacements is the contravariant antisymmetric tensor dS“; the vector 


dual to it is gotten by multiplying by the tensor J-g €iig» SO It Is equal to 
JV -gdS; = — Lenin d SH! =g. (83.15) 


Similarly, if df is the element of (two-dimensional) surface spanned by two infinitesimal 
displacements, the dual tensor is defined ast 


V -g dfa = lv ~9 egi df (83.16) 


We keep the designations dS; and df,* as before for | ej, d$" and dei, d/"" (and 


not their products by 4-9; the rules (6.14-19) for transforming the various integrals 
into one another remain the same, since their derivation was formal in character and not 
related to the tensor properties of the different quantities. Of particular importance is the 
rule for transforming the integral over a hypersurface into an integral over a four-volume 
(Gauss' theorem), which is accomplished by the substitution: 


dS; ^ dQ E. (83.17) 
Ux 


+ H ¢ is a scalar, the quantity V -—g ¢, which gives an invariant when integrated over dQ, is called a 
scalar density. Similarly, we speak of vector and tensor densities V —g A', V --g A'*, etc. These quantities 
give a vector or tensor on multiplication by the four-volume clement dQ (the integral f A‘ V —g dQ over 
a finite region cannot, generally speaking, be a vector, since the laws of transformation of the vector 4' are 
different at different points). 

t It is understood that the elements dS *'" and df are constructed on the infinitesimal displacements 
dx', dx", dx"! in the same way as in § 6. no matter what the geometrical significance of the coordinates x'. 
Then the formal significance of the elements dS; and df}, is the same as before. In particular, as before 
dSo = dx, dx; dx; = dV. We keep the earlier definition of dV for the product of differentials of the three 
space coordinates; we must, however, remember that the element of geometrical spatial volume is given in 
curvilinear coordinates not by dV, but by * 7 dV, where y is the determinant of the spatial metric tensor 
(which will be defined in the next section). 
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§ 84. Distances and time intervals 


We have already said that in the general theory of relativity the choice of a coordinate 
system is not limited in any way; the triplet of space coordinates x', x, x°, can be any 
quantities defining the position of bodies in space, and the time coordinate x? can be 
defined by an arbitrarily running clock. The question arises of how, in terms of the values 
of the quantities x!, x?, x°, x°, we can determine actual distances and time intervals. 

First we find the relation of the proper time, which from now on we shall denote by r. 
to the coordinate x?. To do this we consider two infinitesimally separated events, occurring 
at one and the same point in space. Then the interval ds between the two events is, as we 
know, just edt, where dt is the (proper) time interval between the two events. Setting 
dx! = dx? = dx? = Q in the general expression ds? = gi dx! dx’, we consequently find 


ds? = cà dt? = gool(dx°)’, 
from which 


1 
dt= v good, (84.1) 
c 


or else, for the time between any two events occurring at the same point in space, 


1 
t=. Í V goo d x9. (84.2) 


This relation determines the actual time interval (or as it is also called, the proper time 


for the given point in space) for a change of the coordinate x°. We note in passing that the 
quantity goo, as we see from these formulas, is positive: 


Joo > 0. (84.3) 


It is necessary to emphasize the difference between the meaning of (84.3) and the meaning 
of the signature [the signs of three principal values of the tensor gj (§ 82)]. A tensor gj, 
which does not satisfy the second of these conditions cannot correspond to any real gravita- 
tional field, i.e. cannot be the metric of a real space-time. Nonfulfilment of the condition 
(84.3) would mean only that the corresponding system of reference cannot be reaiized with 
real bodies; if the condition on the principal values is fulfilled, then a suitable transforma- 
tion of the coordinates can make go, positive (an-example of such a system is given by the 
rotating system of coordinates, see $ 89). 

We now determine the element d/ of spatial distance. 1n the special theory of relativity 
we can define d/ as the interval between two infinitesimally separated events occurring at 
one and the same time. In the general theory of relativity, it is usually impossible to do this, 
i.e. it is impossible to determine d! by simply setting dx? = 0 in ds. This is related to the fact 


that in a gravitational field the proper time at different points in space has a different 
dependence on the coordinate x°. 


To find d/, we now proceed as follows. 

Suppose a light signal is directed from some point B in space (with coordinates x? + dx?) 
to a point À infinitely near to it (and having coordinates x*) and then back over the same 
path. Obviously, the time (as observed from the one point B) required for this, when 
multiplied by c, is twice the distance between the two points. 

Let us write the interval, separating the space and time coordinates: 


ds? = g p dx? dx"  2go, dx? dx? goo (dx?) (84.4) 
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where it is understood that we sum over repeated Greek indices from | to 3. The interval 
between the events corresponding to the departure and arrival of the signal from one point 
to the other is equal to zero. Solving the equation ds? = 0 with respect to dx°, we find two 
roots: 


i : 

dx? = Gor (8o. dx* — (85005 — Jap Joo) dx? dx}, 
W eee ee (84.5) 

ay te ae C7 go4 dX* + N (804805 — 045800) dX^ dx}, 
00 


corresponding to the propagation of the signal in the two directions between A and B. If x? 
is the moment of arrival of the signal at A, the times when it left B and when it will return 
to B are, respectively, x?-Fdx$) and x°+dx‘?). In the schematic diagram of Fig. 18 the 
solid lines are the world lines corresponding to the given coordinates x* and x*+dx*, while 
the dashed lines are the world lines of the signals.t It is clear that the total interval of "time" 
between the departure of the signal and its return to the original point is equal to 


2 PUE NN EU 
dx?? - dx) = — V(9o.op— Gap Goo) dx" dx". 


Joo 
(2) 
x? dx, 
w^. 
"d 
t 
x? 
N 
^w 
Low (1) 
Mx dx, 
A B 
Fic. 18 


The corresponding interval of proper time is obtained, according to (84.1), by multiplying 


by V goolts and the distance d/ between the two points by multiplying once more by c/2. As 
a result, we obtain 


00 y 


This is the required expression, defining the distance in terms of the space coordinate 
elements. We rewrite it in the form 


dl? = Y, dx* dx*, (84.6) 
where 
rea = (7o n) (84.7) 
Joo 


t In Fig. 18, it is assumed that dx(? > 0, dx$? « 0, but this is not necessary: dx*? and dx°? may have 
the same sign. The fact that in this case the value x°(A) at the moment of arrival of the signal at A might 
be less than the value x*(B) at the moment of its departure from B contains no contradiction, since the 
rates of clocks at different points in space are not assumed to be synchronized in any way. 
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is the three-dimensional metric tensor, determining the metric, i.e., the geometric properties 
of the space. The relations (84.7) give the connection between the metric of real space and 
the metric of the four-dimensional space-time.t 

However, we must remember that the g;, generally depend on x°, so that the space metric 
(84.6) also changes with time. For this reason, it is meaningless to integrate d/; such an 
integral would depend on the world line chosen between the two given space points. Thus, 
generally speaking, in the general theory of relativity the concept of a definite distance 
between bodies loses its meaning, remaining valid only for infinitesimal distances. The only 
case where the distance can be defined also over a finite domain is that in which the gx 
do not depend on the time, so that the integral f d/ along a space curve has a definite 
meaning. 

It is worth noting that the tensor — y, is the reciprocal of the contravariant three-dimen- 
sional tensor g”. In fact, from g'*g,, = ôi, we have, in particular, 


ggg, +g go, = 05, — g""ggo-g' goo — 0, — g^ gs g^" go = 1. (84.8) 
Determining g*? from the second equation and substituting in the first, we obtain: 
=g Ys, = 95. 


This result can be formulated differently, by the statement that the quantities — g*? form 
the contravariant three-dimensional metric tensor corresponding to the metric (84.6): 


y? = =g”. (84.9) 
We also state that the determinants g and y, formed respectively from the quantities g;, 
and Yap, are related to one another by 


=g = gooY- (84.10) 
In some of the later applications it will be convenient to introduce the three-dimensional 
vector g, whose covariant components are defined as 
g= — 228. (84.11) 
Joo 
Considering g as a vector in the space with metric (84.6), we must define its contravariant 


components as g* = y^ 95. Using (84.9) and the second of equations (84.8), it is easy to see 
that 


g= gg = —g™. (84.12) 
We also note the formula 
' . 1 
g°° = — —9,9°, (84.13) 
Joo 


which follows from the third of equations (84.8). 


t The quadratic form (84.6) must clearly be positive definite. For this, its coefficients must, as we know 
from the theory of forms, satisfy the conditions 
ly Yu! iyii 12 71a! 
231 > 0, SD £2 0, yn Yo2 Yas, > 0. 
[Yaa Yaa! i "E 
‘Yar Yaz Yaa 
Expressing y,, in terms of g, it is easy to show that these conditions take the form 


Goo gos! ‘goo goi goz! 
gio gui S> 919 9n 92 >0,  g«0. 
Mu lg2o 21 922i 


These conditions, together with the condition (84.3), must be satisfied by the components of the metric 
tensor in every system of reference which can be realized with the aid of real bodies. 
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We now turn to the definition of the concept of simultaneity in the general theory of 
relativity. In other words, we discuss the question of the possibility of synchronizing clocks 
located at different points in space, i.e. the setting up of a correspondence between the 
readings of these clocks. 

Such a synchronization must obviously be achieved by means of an exchange of light 
signals between the two points. We again consider the process of propagation of signals 
between two infinitely near points A and B, as shown in Fig. 18. We should regard as simul- 
taneous with the moment x? at the point A that reading of the clock at point B which is half- 
way between the moments of departure and return of the signal to that point, i.e. the moment 


x? HAX? = x? Hdx 4 dx), 


Substituting (84.5), we thus find that the difference in the values of the “time” x? for two 
simultaneous events occurring at infinitely near points is given by 


go, dx^ 
Joo 

This relation enables us to synchronize clocks in any infinitesimal region of space. Carry- 
ing out a similar synchronization from the point A, we can synchronize clocks, i.e. we can 
define simultaneity of events, along any open curve.t 

However, synchronization of clocks along a closed contour turns out to be impossible in 
general. In fact. starting out along the contour and returning to the initial point, we would 
obtain for Ax? a value different from zero. Thus it is, a fortiori, impossible to synchronize 
clocks over all space. The exceptional cases are those reference systems in which all the 
components go, are equal to zero. 

It should be emphasized that the impossibility of synchronization of all clocks is a property 
of the arbitrary reference system, and not of the space-time itself. In any gravitational field, 
it is always possible (in infinitely many ways) to choose the reference system so that the three 
quantities go, become identically equal to zero, and thus make possible a complete synchro- 
nization of clocks (see $ 97). 

Even in the special thcory of relativity, proper time elapses differently for clocks moving 
relative to one another. In the general theory of relativity, proper time 'elapses differently 
even at different points of space in the same reference system. This means that the interval 
of proper time between two events occurring at some point in space, and the interval of 
time between two events simultaneous with these at another point in space, are in general 
different from one another. 


Ax? = = g,dx". (84.14) 


§ 85. Covariant differentiation 


In galilean coordinates§ the differentials d4; of a vector A; form a vector, and the 
derivatives ¢4,/Ex* of the components of a vector with respect to the coordinates form a 
tensor. In curvilinear coordinates this is not so; dA; is not a vector, and 04,/0x* is not a 


t Multiplying (84.14) by goo and bringing both terms to one side, we can state the condition for syn- 
chronization in the form dxo = go; dx! = 0: the “covariant differential” dxo between two infinitely near 
simultaneous events must be equal to zero. 

+ We should also assign to this class those cases where the go, can be made equal to zero by a simple 
transformation of the time coordinate, which does not involve any choice of the system of objects serving 
for the definition of the space coordinates. 

$ In general, whenever the quantities g,, are constant. 
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tensor. This is due to the fact that dA, is the difference of vectors located at different (in- 
finitesimally separated) points of space; at different points in space vectors transform 
differently, since the coefficients in the transformation formulas (83.2), (83.4) are functions 
of the coordinates. 

It is also easy to verify these statements directly. To do this we determine the transforma- 
tion formulas for the differentials dA; in curvilinear coordinates. A covariant vector is 
transformed according to the formula 


4 Lr uu 
Hind 6x! k> 
therefore 
Ox'* ox’  ox'* Q* 
dA, 2 —4 dA} + Aid — = — dAL A, —— dx. 
xt ^U axt xi OORT Gxt ay! 


Thus dA, does not transform at all like a vector (the same also applies, of course, to the 
differential of a contravariant vector). Only if the second derivatives 0?x’*/éx'éx! = 0, i.e. 
if the x'* are linear functions of the x*, do the transformation formulas have the form 

Óx'* 
that is, dA; transforms like a vector. 

We now undertake the definition of a tensor which in curvilinear coordinates plays the 
same role as ĝA;/ôx" in galilean coordinates. In other words, we must transform €4,/éx* 
from galilean to curvilinear coordinates. 

In curvilinear coordinates, in order to obtain a differential of a vector which behaves like 
a vector, it is necessary that the two vectors to be subtracted from each other be located at 
the same point in space. In other words, we must somehow "'translate" one of the vectors 
(which are separated infinitesimally from each other) to the point where the second is 
located, after which we determine the difference of two vectors which now refer to one and 
the same point in space. The operation of translation itself must be defined so that in galilean 
coordinates the difference shall coincide with the ordinary differential dA;. Since dA; is just 
the difference of the components of two infinitesimally separated vectors, this means that 
when we use galilean coordinates the components of the vector should not change as a result 
of the translation operation. But such a translation is precisely the translation of a vector 
parallel to itself. Under a parallel translation of a vector, its components in galilean co- 
ordinates do not change. If, on the other hand, we use curvilinear coordinates. then in 
general the components of the vector will change under such a translation. Therefore in 
curvilinear coordinates, the difference in the components of the two vectors after translating 
one of them to the point where the other is located will not coincide with their difference 
before the translation (i.e. with the differential dA;). 

Thus to compare two infinitesimally separated vectors we must subject one of them to a 
parallel translation to the point where the second is located. Let us consider an arbitrary 
contravariant vector; if its value at the point x' is 4’, then at the neighboring point x' 4- dx! 
it is equal to 4'-- dA'. We subject the vector 4’ to an infinitesimal parallel displacement to 
the point x'4- dx; the change in the vector which results from this we denote by 54’. Then 
the difference DA' between the two vectors which are now located at the same point is 


DA! = dA! —ó4A'. (85.1) 
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The change 64‘ in the components of a vector under an infinitesimal parallel displacement 
depends on the values of the components themselves, where the dependence must clearly be 
linear. This follows directly from the fact that the sum of two vectors must transform accord- 
ing to the same law as each of the constituents. Thus 54‘ has the form 

ó4! = -TiA* dx, (85.2) 
where the T}, are certain functions of the coordinates. Their form depends, of course, on the 
coordinate system; for a galilean coordinate system Tj, = 0. 

From this it is already clear that the quantities T$, do not form a tensor, since a tensor 
which is equal to zero in one coordinate system is equal to zero in every other one. In a 
curvilinear space it is, of course, impossible to make all the T}, vanish over all of space. 

But the principle of equivalence requires that by a suitable choice of coordinate system 
we can eliminate the gravitational field over a given infinitesimal region of space, i.e., we 
can make the quantities I$, vanish in it. We shall see later in $87 that the I", play the role 
of field strengths.t 

The quantities T}, are called “connection coefficients" or “Christoffel symbols". 

In addition to the quantities Ij, we shall later also use quantities I';,;1 defined as follows: 

D; ai = Iim Tir (85.3) 
Conversely, 
D- g”T mar (85.4) 

It is also easy to relate the change in the components of a covariant vector under a parallel 
displacement to the Christoffel symbols. To do this we note that under a parallel displace- 
ment, a scalar is unchanged. In particular, the scalar product of two vectors does not change 
under a parallel displacement. 

Let 4, and B be any covariant and contravariant vectors. Then from ò(4; B’) = 0, we 
have 


7 M 7 Bi 6A; = — A; ôB = Ti, B*A; dx! 
or, changing the indices, 


Bi dA, = ThA, Bi dx. 
From this, in view of the arbitrariness of the B', 
ôA; = ThA, dx! (85.5) 
which determines the change in a covariant vector under a parallel displacement. 
Substituting (85.2) and dA! = (0A'[Ox!) dx! in (85.1), we have 


. [ðA —(2 
DA‘ = (S + ") dx! (85.6) 
Cx 
Similarly, we find for a covariant vector, 
^A. 
DA, = E = ria) dx'. (85.7) 


The expressions in parentheses in (85.6) and (85.7) are tensors, since when multiplied by 
the vector dx' they give a vector. Clearly, these are the tensors which give the desired 
generalization of the concept of a derivative to curvilinear coordinates. These tensors are 
called the covariant derivatives of the vectors A‘ and A; respectively. We shall denote them 
by A*., and A; Thus, 

DA! = A. dx'; DA; = Aj, dx, (85.8) 

+ This is precisely the coordinate system which we have in mind in arguments where we, for brevity's sake, speak 


of a “galilean” system; still all the proofs remain applicable not only to flat, but also to curved 4-space. 


1 In place of Tf, and I,,,,, the symbols Uu and I] are sometimes used. 


240 PARTICLE IN A GRAVITATIONAL FIFLD § 85 


while the covariant derivatives themselves are: 


. GA! ; 
A, 2 TuS, (85.9) 
. Ox 
JA; 
Ani = = ThA (85.10) 
CN 


In galilean coordinates, Pj, = 0, and covariant differentiation reduces to ordinary differen- 
tiation. 

It is also easy to calculate the covariant derivative of a tensor. To do this we must deter- 
mine the change in the tensor under an infinitesimal parallel displacement. For example, let 
us consider any contravariant tensor, expressible as a product of two contravariant vectors 
A‘B*. Under parallel displacement, 

ó( A'B*) = A6 B* + B'óA! = — ATE, B'dx" — B'Ti, A'dx". 
By virtue of the linearity of this transformation we must also have, for an arbitrary tensor 
A* 
, 
6A* = —(APT* + AMT) dx. (85.11) 
Substituting this in 
DAY = dA* —6A* = A i dx, 
we get the covariant derivative of the tensor A“ in the form 
oA® 


A® = TATE RA". (85.12) 
Cx 


In completely similar fashion we obtain the covariant derivative of the mixed tensor 4j 
and the covariant tensor A; in the form 


. GA! . . 
iom Iuda PAG (85.13) 
Cx 
0A, 
Aiki = EX — Ut Ams — VitAim- (85.14) 


One can similarly determine the covariant derivative of a tensor of arbitrary rank. In 
doing this one finds the following rule of covariant differentiation. To obtain the covariant 
derivative of the tensor A`: : with respect to x’, we add to the ordinary derivative £A: : /Cx' 
for each covariant index i(A:;:) a term — lA g., and for each contravariant index i(A: ! ^) 
aterm +Tj,A°*:. 

One can easily verify that the covariant derivative of a product is found by the same rule as 
for ordinary differentiation of products. In doing this we must consider the covariant 
derivative of a scalar $ as an ordinary derivative, that is, as the covariant vector $, = &$/Cx*. 
in accordance with the fact that for a scalar ô = 0, and therefore Dé = d$. For example, 
the covariant derivative of the product A; B, is 


(A; Bj); , = Ai; Bit Ai Bko 
If in a covariant derivative we raise the index signifying the differentiation, we obtain the 
so-called contravariant derivative. Thus, 


A;* = g A; Auk = g At. 


We now derive formulas for the transformation of the Christoffel symbols from one 
coordinate system to another. 
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These formulas can be got by comparing the two equations that determine the covariant 
derivatives and requiring that these laws be the same for both. A simple calculation gives 
Ox! Ox" ox? Qu" ox! 
" Qx'" Ax* Ox!  Ox*Ox! ax’™ 
From this formula we see that the [, transforms like a tensor only for linear coordinate 
transformations (when the second term in 85.15 drops out). 

However we note that this term is symmetric in k and /, and therefore drops out for the 
transformation of Si, =T — T}. It therefore transforms like a tensor: 

i gm Ox! Ox" Ox"? 
HOC xn Ax* xt 
and is called the "curvature tensor" of the space. 

We now show that in this theory, based on the equivalence principle, the curvature tensor 
must be zero. In fact, as already stated, by virtue of the equivalence principle there must 
be a "galilean" coordinate system in which the Tj,, and consequently also the S$, vanish 
at a given point. Since Sj, is a tensor, if it vanishes in one coordinate system it must vanish 


in all frames. This means that the Christoffel symbols must be symmetric in their lower 
indices: 


(85.15) 


D =T}, (85.16) 
Clearly, also 

Du I. (85.17) 
In general, there are altogether forty different quantities I7; for each of the four values of 
the index i there are ten different pairs of values of the indices k and / (counting pairs 
obtained by interchanging k and / as the same). 

Formula (85.15) enables us to prove easily the assertion made above that it is always 
possible under condition (85.16) to choose a coordinate system in which all the T% 
become zero at a previously assigned point (such a system is daid to be /ocally-inertial or 
locally -geodesic (see § 87)).t 

In fact, let the given point be chosen as the origin of coordinates, and let the values of the 
Ti, at it be initially (in the coordinates x) equal to (T1))o. In the neighborhood of this point, 
we now make the transformation 


x" = x'+4(Ti)oxtx!. (85.18) 
Then 
Qux" x! 
OT OX ) qp (85.19) 
(sa ôx! T), ( xo 


and according to (85.15), all the T/y become equal to zero. 

We emphasize that condition (85.16) is essential: the expression on the left side of (85.19) 
is symmetric in k and /, and so too must be the right side of the equation. 

We note that for the transformation (85.18) 


ax"\ — 
(2), 7 


t It can also be shown that, by a suitable choice of the coordinate system, one can make all the Ti go 
to zero not just at a point but all along a given world line. (The proof of this statement can be found in the 
book by P. K. Rashevskii, Riemannian Geometry and Tensor Analysis,, Nauka, 1964, $ 91.) 
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so that it does not change the value of any tensor (including the tensor g;,) at the given point, 
so that we can make the Christoffel symbols vanish at the same time as we bring the gj, to 
galilean form. 


$ 86. The relation of the Christoffel symbols to the metric tensor 


Let us show that the covariant derivative of the metric tensor gj, is zero. To do this we 

note that the relation 
DA, = g4DA* 
is valid for the vector DA; as for any vector. On the other hand. 4; = gi A". so that 
DA; = D(gA*) = 9,DA‘+ A'Dgi. 
Comparing with DA, = gj, DA‘, and remembering that the vector A* is arbitrary? 
. . B Dg; = 0. 
Therefore the covariant derivative 
gii = 9. (86.1) 

Thus gj, may be considered as a constant during covariant differentiation. 

The equation gj, , = 0 can be used to express the Christoffel symbols Ij, in terms of the 
metric tensor g. To do this we write in accordance with the general definition (85.14): 


Fix n m _ Iik 
Jik =F F — G mil i — Gim k 774i Thai = 0. 
Cx x 


Thus the derivatives of gą are expressed in terms of the Christoffel symbols.+ We write 
the values of the derivatives of gi, permuting the indices i, k, /: 


Jik 
ax licct ie 
0j 
5 1 = I ut T, us 
Cg T 
a == =l hki Drai 
Cx 
Taking half the sum of these equations, we find (remembering that T; p = D; 4) 
l(Oga Ja Ogu 
Pe dq 36.2 
2b. (4 Ox* ax! (86-2) 
From this we have for the symbols T7, = g"T, x1, 
i Vim (€9mk , Gm Oa 
= im x m + m = : i 
Sia ( ex! Ox* 2) (86.3) 


These formulas give the required expressions for the Christoffel symbols in terms of the 
metric tensor. 

We now derive an expression for the contracted Christoffel symbol Ti; which will be 
important later on. To do this we calculate the differential dg of the determinant g made up 
from the components of the tensor g;,; dg can be obtained by taking the differential of each 
component of the tensor gj, and multiplying it by its coefficient in the determinant, i.e. by 
the corresponding minor. On the other hand, the components of the tensor g* reciprocal 
to gi are equal to the minors of the determinant of the g, divided by the determinant. 


+ Choosing a locally-geodesic system of coordinates therefore means that at the given point all the first 
derivatives of the components of the metric tensor vanish. 
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Therefore the minors of the determinant g are equal to gg". Thus, 
dg = gg"dgx, = —99xdg" (86.4) 
(since gag” = ôi = 4, g* dg, = — gx dg”). 
From (86.3), we have 


i im [EGmk , CGmi Ogri 
ri, =- im me, em 
47359 ( ex | ex* ox" 


“a 


Changing the positions of the indices m and i in the third and first terms in parentheses, we 
see that these two terms cancel each other, so that 


or, according to (86.4), 
© lôg ênv g 
ri =— 2 ONT 86.5 
HO 2g at ex (86.5) 
It is useful to note also the expression for the quantity g“Ti,; we have 
ael ogg a mk Iim C ~ [Jmk | OG 
g"Ti, -e( Imk + 91 - 4) = se ( I mk _ xj 


Ox! xt gx" ox | 2 ax" 


With the help of (86.4) this can be transformed to 


dpi 1 ó(N-gg" 
g'ri = -— 99) (86.6) 
\-9 ex 
For various calculations it is important to remember that the derivatives of the contra- 
variant tensor g* are related to the derivatives of gj, by the relations 


e Ik egi 
Git om = g" (86.7) 
X CX 


(which are obtained by differentiating the equality g,,g'* = 6‘). Finally we point out that the 
derivatives of g^ can also be expressed in terms of the quantities Tjj. Namely, from the 
identity g^. , = 0 it follows directly that 
^ ik 
eg i ni im 
ex! = ~I mg Thug . (86.8) 
With the aid of the formulas which we have obtained we can put the expression for 


A. ; the generalized divergence of a vector in curvilinear coordinates, in convenient form. 
Using (86.5), we have 


0A 6A! Zg 
A, a tria =a MA d 
ex Cx ex 
or, finally, 
l 1 aNg Al 
A’ =- 898 794) (86.9) 
vg ex! 


We can derive an analogous expression for the divergence of an antisymmetric tensor 
A'**. From (85.12), we have 
" cA" ; 
A* — ERI +Ti,A™ +k, A'", 
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But, since A™ = — A*", 
ri,A™ = -Ti A "= 0. 
Substituting the expression (86.5) for T'4,, we obtain 
f | a a A* 
an Qu d (86.10) 
y -g ox* 


Now suppose Aj, is a symmetric tensor; we calculate the expression A% , for its mixed 
components. We have 
ôA; k yl l k 
Ain = ae Hri TA = N 
Ox vV =g Cx 


| UN) 


k —T At. 


The last term here is equal to 


z + : 
ox® ax! Ox! 
Because of the symmetry of the tensor A“, two of the terms in parentheses cancel each other, 
leaving 


-5 e Cy ay 22) A". 


^, k 
Ab, m a= ag) -199u qu. (86.11) 
/-g x 2 Ox 
In cartesian coordinates, 04;/0x* —0A,/0x! is an antisymmetric tensor. In curvilinear 
coordinates this tensor is A;.,—A,.;. However, with the help of the expression for A;,, 
and since T}, = Ti,, we have 
0A; QA, 
ax* ax 
Finally, we transform to curvilinear coordinates the sum 67@/éx,dx' of the second 
derivatives of a scalar $. It is clear that in curvilinear coordinates this sum goes over into 
$i But ¢,; = 6¢/dx', since covariant differentiation of a scalar reduces to ordinary dif- 
ferentiation. Raising the index i, we have 


=g 


Aj, 4 Ags = (86.12) 


ik og 
ox” 
and using formula (86.9), we find 


- 1 6 ~ 406 
$i =r (v=aa" ) (86.13) 
Vg dx! ôx" 
It is important to note that Gauss’ theorem (83.17) for the transformation of the integral 


of a vector over a hypersurface into an integral over a four-volume can, in view of (86.9), be 
written as 


$ AW —g dS; = T A, | —g dQ. (86.14) 


§ 87. Motion of a particle in a gravitational field 


The motion of a free material particle is determined in the special theory of relativity 
from the principle of least action, 


6S = —med [ ds =0, (87.1) 
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according to which the particle moves so that its world line is an extremal between a given 
pair of world points, in our case a straight line (in ordinary three-dimensional space this 
corresponds to uniform rectilinear motion). 

The motion of a particle in a gravitational field is determined by the principle of least 
action in this same form (87.1), since the gravitational field is nothing but a change in the 
metric of space-time, manifesting itself only in a change in the expression for ds in terms of 
the dx’. Thus, in a gravitational field the particle moves so that its world point moves along 
an extremal or, as it is called, a geodesic line in the four-space x°, x', x?, x^; however, since 
in the presence of the gravitational field space-time is not galilean, this line is not a "straight 
line", and the real spatial motion of the particle is neither uniform nor rectilinear. 

Instead of starting once again directly from the principle of least action (sec the problem 
at the end of this section), it is simpler to obtain the equations of motion of a particle in a 
gravitational field by an appropriate generalization of the differential equations for the free 
motion of a particle in the special theory of relativity, i.e. in a galilean four-dimensional 
coordinate system. These equations are du'/ds = 0 or di = 0, where u' = dx'/ds is the four- 
velocity. Clearly, in curvilinear coordinates this equation is generalized to thé equation 

Du! = 0. (87.2) 
From the expression (85.6) for the covariant differential of a vector, we have 
dui 4 Ti u* dx! = 0. 
Dividing this equation by ds, we have 
d? x! - dx* dx! 
ds tuga 0. (87.3) 

This is the required equation of motion. We see that the motion of a particle in a gravita- 
tional field is determined by the quantities I';,. The derivative d?x'/ds? is the four-accelera- 
tion of the particle. Therefore we may call the quantity —mTj,u“u' the "four-force", 
acting on the particle in the gravitational field. Here, the tensor g, plays the role of the 
"potential" of the gravitational field—its derivatives determine the field "intensity" Ij,.t 

In § 85 it was shown that by a suitable choice of the coordinate system one can always 
make all the T}, zero at an arbitrary point of space-time. We now see that the choice of such 
a locally-inertial system of reference means the elimination of the gravitational field in the 
given infinitesimal element of space-time, and the possibility of making such a choice is an 
expression of the principle of equivalence in the relativistic theory of gravitation. } 

As before, we define the four-momentum of a particle in a gravitational field as 

pi = meui. (87.4) 
Its square is 
pip — mic". (87.5) 


+ We also give the form of the equations of motion expressed in terms of covariant components of the 
four-acceleration. From the condition Du, = 0, we find 

du 

Ty uu! = O. 


ds 
Substituting for I. ,, from (86.2), two of the terms cancel and we are Icft with 


+ In the footnote on p. 241 we also noted the possibility of choosing a reference system which is “inertial 
along a given world line." In particular. if this line is the time axis (along which x‘, x?, x? = const), then 
the gravitational field will be eliminated for all times in the given spatial element. 
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Substituting —@S/éx' for pj, we find the Hamilton-Jacobi equation for a particle in a 
gravitational field: 


— mic =0. (87.6) 


The equation of a geodesic in the form (87.3) is not applicable to the propagation of a 
light signal, since along the world line of the propagation of a light ray the interval ds, as 
we know, is zero, so that all the terms in equation (87.3) become infinite. To get the equa- 
tions of motion in the form needed for this case, we use the fact that the direction of pro- 
pagation of a light ray in geometrical optics is determined by the wave vector tangent to the 
ray. We can therefore write the four-dimensional wave vector in the form k’ = dx'/d à, 
where À is some parameter varying along the ray. In the special theory of relativity, in the 
propagation of light in vacuum the wave vector does not vary along the path, that is, 
dk‘ = 0 (see 8 53). In a gravitational field this equation clearly goes over into Dk’ = 0 or 

Z HDuk'k = 0 (87.7) 
(these equations also determine the parameter å). t 
The absolute square of the wave four-vector (see § 48) is zero, that is, 


kk! =0. (87.8) 


Substituting éw/éx' in place of k; (y is the eikonal), we find the eikonal equation in a gravita- 
tional field ` 
a OW OW 


Ox ôx" — 
In the limiting case of small velocities, the relativistic equations of motion of a particle 
in a gravitational field must go over into the corresponding non-relativistic equations. In 
this we must keep in mind that the assumption of small velocity implies the requirement that 
the gravitational field itself be weak; if this were not so a particle located in it would acquire 
a high velocity. 
Let us examine how, in this limiting case, the metric tensor g,, determining the field is 
related to the nonrelativistic potential $ of the gravitational field. 
In nonrelativistic mechanics the motion of a particle in a gravitational field is determined 


by the Lagrangian (81.1). We now write it in the form 
2 


L- — me? + — mó, (87.10) 


(87.9) 


adding the constant — mc?.? This must be done so that the nonrelativistic Lagrangian in the 

absence of the field, L = — mc? +mv?/2, shall be the same exactly as that to which the 

corresponding relativistic function L = —mc?^V1— v?/c? reduces in the limit as v/c — 0. 
Consequently, the nonrelativistic action function S for a particle in a gravitational field 


has the form 
2 
S= f ra= -me f(e- 7 + ©) a 
2c c 


t Geodesics, along which ds Z0, are said to be null or isotropic. 

+ The potential ¢ is, of course, defined only to within an arbitrary additive constant. We assume through- 
out that one makes the natural choice of this constant so that the potential vanishes far from the bodies 
producing the field. 
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Comparing this with the expression S = —mc f ds, we see that in the limiting case under 
consideration 
2 
v 
ds = («- = e dt. 
2c c 


Squaring and dropping terms which vanish for c > oo, we find 
ds? = (c?-20) d? — dr?. (87.11) 


where we have used the fact that v dt = dr. 
Thus in the limiting case the component goo of the metric tensor is 


2 
Joo => 1+ ;4 (87.12) 


As for the other components, from (87.11) it would follow that gap = dep, go, = O. 
Actually, however, the corrections to them are, generally speaking, of the same order of 
magnitude as the corrections to go, (for more detail, see $ 106). The impossibility of deter- 
mining these corrections by the method given above is related to the fact that the corrections 
to the g,g, though of the same order of magnitude as the correction to goo, would give rise 
to terms in the Lagrangian of a higher order of smallness (because in the expression for ds? 
the components gg are not multiplied by c?, while this is the case for goo). 


PROBLEM 


Derive the equation of motion (87.3) from the principle of least action (87.1). 
Solution: We have: 


" 
ds? = 2ds bds = (gu, dx! dx*) = dx! dx* 2 8214-29, dx! dôx*. 


Therefore 


» | dx! dx" agin. dx! dóx* 
LER Í i wd oe OE as 


1 dx! dx" gix d dx! 
=— ZR n iR NS kem rz LV béx*5 dy 
ME Í {5 ds ds Ox à ds V ds pe 
(in integrating by parts, we use the fact that óx* — O at the limits). In the second term in the integral, 


we replace the index k by the index /. We then find, by equating to zero the coefficient of the arbitrary 
variation óx!: 


l (2g d "RET du tye 29 
og ae d wea or og ee sue 9. 


Noting that the third term can be written as 


and introducing the Christoffel symbols T, y, in accordance with (86.2), we have: 


du' 
gu "dis TEL a ul u* = 0. 


Equation (87.3) is obtained from this by raising the index /. 
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§ 88. The constant gravitational field 


A gravitational field is said to be constant if one can choose a system of reference in which 
all the components of the metric tensor are independent of the time coordinate x°; the 
latter is then called the world time. 

The choice of a world time is not completely unique. Thus, if we add to x° an arbitrary 
function of the space coordinates, the gą will still not contain x°; this transformation 
corresponds to the arbitrariness in the choice of the time origin at each point in space.t 
In addition, of course, the world time can be multiplied by an arbitrary constant, i.e. the 
units for measuring it are arbitrary. 

Strictly speaking, only the field produced by a single body can be constant. In a system of 
several bodies, their mutual gravitational attraction will give rise to motion, as a result of 
which the field produced by them cannot be constant. 

If the body producing the field is fixed (in the reference system in which the gj, do not 
depend on x?), then both directions of time are equivalent. For a suitable choice of the time 
origin at all the points in space, the interval ds should in this case not be changed when we 
change the sign of x°, and therefore all the components go, of the metric tensor must be 
identically equal to zero. Such constant gravitational fields are said to be sratic. 

However, for the field produced by a body to be constant, it is not necessary for the body 
to be at rest. Thus the field of an axially symmetric body rotating uniformly about its axis 
will also be constant. However in this case the two time directions are no longer equivalent 
by any means—if the sign of the time is changed, the sign of the angular velocity is changed. 
Therefore in such constant gravitational fields (we shall call them stationary fields) the 
components go, of the metric tensor are in general different from zero. 

The meaning of the world time in a constant gravitational field is that an interval of world 
time between events at a certain point in space coincides with the interval of world time 
between any other two events at any other point in space, if these events are respectively 
simultaneous (in the sense explained in § 84) with the first pair of events. But to the same 
interval of world time x? there correspond, at different points of space, different intervals 
of proper time T. 

The relation between world time and proper time, formula (84.1), can now be written in 
the form 


1 
pu go X, (88.1) 


applicable to any finite time interval. 
If the gravitational field is weak, then we may use the approximate expression (87.12), 


and (88.1) gives 
[0 
pec (i + £) (88.2) 
c Cc 


1 It is easy to see that under such a transformation the spatial metric, as expected, does not change. In 
fact, under the substitution 
x? — x? 4 f(x!, x?, x3) 
with an arbitrary function f(x', x?, x°), the components gix change to 
«8 > Gas + Goo. «f. a T Goal a Gosh. a. 
Joa > Joa + Goof, as Joo — Joo. 
where f.a = @f/éx*. This obviously does not change the tensor (84.7). 
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Thus proper time elapses the more slowly the smaller the gravitational potential at a given 
point in space. i.e., the larger its absolute value (later, in § 96, it will be shown that the poten- 
tial $ is negative). If one of two identical clocks is placed in a gravitational field for some 
time, the clock which has been in the field will thereafter appear to be slow. 

As was already indicated above, in a static gravitational field the components gp, of the 
metric tensor are zero. According to the results of $ 84, this means that in such a field 
synchronization of clocks is possible over all space. We note also that the element of spatial 
distance in a static field is simply: 


dP = —g,, dx" dx. (88.3) 


In a stationary field the go, are different from zero and the synchronization of clocks 
over all space is impossible. Since the g;, do not depend on x°, formula (84.14) for the dif- 
ference between the values of world time for two simultaneous events occurring at different 
points in space can be written in the form 


dx* 

Ax? = — fac (88.4) 
Joo 

for any two points on the line along which the synchronization of clocks is carried out. In 

the synchronization of clocks along a closed contour, the difference in the value of the world 

time which would be recorded upon returning to the starting point is equal to the integral 


Ax? = — p (88.5) 
Joo 

taken along the closed contour.t 

Let us consider the propagation of a light ray in a constant gravitational field. We have 
seen in § 53 that the frequency of the light is the time derivative of the eikonal y (with 
opposite sign). The frequency expressed in terms of the world time x°/c is therefore 
Wo = —c(Oy//0x?). Since the eikonal equation (87.9) in a constant field does not contain x? 
explicitly, the frequency c remains constant during the propagation of the light ray. The 
frequency measured in terms of the proper time is œ = — (0/01); this frequency is different 
at different points of space. 

From the relation 


we have 
os -—— (88.6) 
v Goo 
In a weak gravitational field we obtain from this, approximately, 
= Wo ( — $) (88.7) 


We see that the light frequency increases with increasing absolute value of the potential of 
the gravitational field, i.e. as we approach the bodies producing the field; conversely, as the 
light recedes from these bodies the frequency decreases. If a ray of light, emitted at a point 


t The integral (88.5) is identically zero if the sum £o, dx?/gso is an exact differential of some function of 
the space coordinates. However, such a case would simply mean that we are actually dealing with a static 
field, and that all the gẹ could be made equal to zero by a transformation of the form x*— x° f (x°). 
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where the gravitational potential is $,, has (at that point) the frequency œw, then upon 
arriving at a point where the potential is $;, it will have a frequency (measured in units of 
the proper time at that point) equal to 


a (oS) oett) 
[4 


A line spectrum emitted by some atoms located, for example, on the sun, looks the same 
there as the spectrum emitted by the same atoms located on the earth would appear on it. 
If, however, we observe on the earth the spectrum emitted by the atoms located on the sun, 
then, as follows from what has been said above, its lines appear to be shifted with respect 
to the lines of the same spectrum emitted on the earth. Namely, each line with frequency w 
will be shifted through the interval Ao given by the formula 

Ao = eoe W, (88.8) 
where ¢, and @, are the potentials of the gravitational field at the points of emission and 
Observation of the spectrum respectively. If we observe on the earth a spectrum emitted on 
the sun or the stars, then |$,| > |$;], and from (88.8) it follows that Aw < 0, i.e. the shift 
occurs in the direction of lower frequency. The phenomenon we have described is called the 
“red shift". 

The occurrence of this phenomenon can be explained directly on the basis of what has 
been said above about world time. Because the field is constant, the interval of world time 
during which a certain vibration in the light wave propagates from one given point of space 
to another is independent of x°. Therefore it is clear that the number of vibrations occurring 
in a unit interval of world time will be the same at all points along the ray. But to one and 
the same interval of world time there corresponds a larger and larger interval of proper time, 
the further away we are from the bodies producing the field. Consequently, the frequency, 
i.e. the number of vibrations per unit proper time, will decrease as the light recedes from these 
masses. 

During the motion of a particle in a constant field, its energy, defined as 


os 


TE ax?! 


the derivative of the action with respect to the world time, is conserved; this follows, for 
example, from the fact that x? does not appear explicitly in the Hamilton-Jacobi equation. 
The energy defined in this way is the time component of the covariant four-vector of 
momentum p, = mcu, = mcg,;w. In a static field, ds? = goo(dx°)? — dl?, and we have for 
the energy, which we here denote by £o, 


dx? dx? 
o = mc?ggg — = MC? Joo ———————. 
o = mMc goo ds mc "00 Joo dx)? — dii 
We introduce the velocity 
m dl |. cdl 
dt V goo dx 


of the particle, measured in terms of the proper time, that is, by an observer located at the 
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given point. Then we obtain for the energy 


— 


mc goo 


Eo = — eee, (88.9) 


This is the quantity which is conserved during the motion of the particle. 

It is easy to show that the expression (88.9) remains valid also for a stationary field. if 
only the velocity v is measured in terms of the proper time, as determined by clocks syn- 
chronized along the trajectory of the particle. If the particle departs from point A at the 
moment of world time x? and arrives at the infinitesimally distant point B at the moment 
x°+dx°, then to determine the velocity we must now take, not the time interval 
(x°+dx°)—x° — dx?, but rather the difference between x°+dx° and the moment 
x9 —(GoalGoo)dx* which is simultaneous at the point B with the moment x? at the point 4: 


(x°+dx°)+ (2- Soa x) = dx? + 295 qus 
goo Joo 
Multiplying by Goole, we obtain the corresponding interval of proper time, so that the 
velocity is 


pa 0. (88.10) 
Vh(dx® — g, dx?) 


where we have introduced the notation 


g 
h-go.  4,27— —— (88.11) 
Joo 
for the three-dimensional vector g (which was already mentioned in § 84) and for the three- 
dimensional scalar gop. The covariant components of the velocity v form a three-dimen- 
sional vector in the space with metric y,,, and correspondingly the square of this vector is 
to be taken ast 
= Jt? vv? = vv. (88.12) 
We note that with such a definition, the interval ds is expressed in terms of the velocity in 
the usual fashion: 


ds? = goo (dx°)?  2go, dx? dx* + gan dx* dx? 
= h(dx?—g, dx*)? — dl? 


2 
= h(dx* — g, dx’)? (1 - 2) (88.13) 
The components of the four-velocity 
u' = dx’ 
~ ds 


+ In our further work we shall repeatedly introduce, in addition to four-vectors and four-tensors, three- 
dimensional vectors and tensors defined in the space with metric yas; in particular the vectors g and v, which 
we have already used, are of this type. Just as in four dimensions the tensor operations (in particular, raising 
and lowering of indices) are done using the metric tensor gix, so, in three dimensions these are done using the 
tensor Yas. To avoid misunderstandings that may arise, we shall denote three-dimensional quantities by 
symbols other than those used for four-dimensional quantities. 
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are 


v= —— u? = — E AT (88.14) 


c Gi c 
The energy is 
Ey = mc?gg;u! = me h(u? — g,u*), 
and after substituting (88.14), takes the form (88.9). 


In the limiting case of a weak gravitational field and low velocities, by substituting 
Joo = 1+(2¢/c?) in (88.9), we get approximately: 


2 
o = me? + = 4 mó, (88.15) 


where ni is the potential energy of the particle in the gravitational field, which is in agree- 
ment with the Lagrangian (87.10). 


PROBLEMS 


I. Determine the force acting on a particle in a constant gravitational field. 
Solution: For the components of F, which we need, we find the following expressions: 


] 
5-5 si) 5 90h" , 0 


a A l 
Ts, = 45; + 5 [IIF — 95) + 9 (gj! — 9:3) +5 9,9, ^". 


In these expressions all the tensor operations (covariant differentiation, raising and lowering of 
indices) are carried out in the three-dimensional space with metric y,5, on the three-dimensional 
vector g^ and the three-dimensional scalar h (88.11); 4%, is the three-dimensional Christoffel 
symbol, constructed from the components of the tensor 7,; in just the same way as T}, is con- 
structed from the components of gix; in the computations we use (84.9-12). 

Substituting (1) in the equation of motion 


du* 
ds 


and using the expression (88.14) for the components of the four-velocity, we find after some simple 
transformations: 


d v hie V/A(g:,—gg ^ Ay wD 


ds m = v p p (2) 
c Hi 2 (1-2) (1-3) (1-3) 


The force f acting on the particle is the derivative of its momentum p with respect to the (syn- 
chronized) proper time, as defined by the three-dimensional covariant differential: 


v? D, f, | v d m” „a Dmv^v* 
c ds = zu ^N c? ds J y2 m x 


c? c? 


= —De(u*y — 2Psgu^u? — T5 uPu* 
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From (2) we therefore have (for convenience we lower the index «): 


or, in the usual three-dimensional notation,t 


mc? 


f= TT b In Vha Vh : (curl Ji (3) 
Neer 


We note that if the body is at rest, then the force acting on it [the first term in (3)] has a potential. 


For low velocities of motion the second term in (3) has the form mc V hv» (curl g) analogous to 
the Coriolis force which would appear (in the absence of the field) in a coordinate system rotating 
with angular velocity 


Q= 5 Vicurl g. 


2, Derive Fermat's principle for the propagation of a ray in a constant gravitational field. 
Solution: Fermat's principle (§ 53) states: 


ô f K, dx* =Q, 
where the integral is taken along the ray, and the integral must be expressed in terms of the frequency 
(9 (which is constant along the ray) and the coordinate differentials. Noting that ko = —ey/¢x° = 


(o/c), we write: 
Mo 


voh5- goi $! = goo kK? goa k* = h(k°—gak*). 


Substituting this in the relation A; A‘ = gix k'k" = 0, written in the form 
h(k? —g. k*y? — yap ktk? = — 


T In three-dimensional curvilinear coordinates, the unit antisymmetric tensor is defined as 
] 


—- er’, 

Vy 
where e,2; = e!?? = |, and the sign changes under transposition of indices [compare (83.13-14)]. Accord- 
ingly the vector c =a xb, defined as the vector dual to the antisymmetric tensor Cay = ag b,—a, bs, has 
components: 


lagy = vy agys n” = 


- 7 , 1 ; ] , 
€, = AVY eg, c? = V v Cay a bt, C mL e Pres, = — e?" by. 
2 y 
Conversely, 
- 1 
Cag = Vy Canc’, C“ = —_ Cy, 
vy 


In particular, curl a should be understood in this same sense as the vector dual to the tensor 
05; 4 — 04; s = (Cag/Ox*) — (6a,/0x^), so that its contravariant components are 


Ó [7 
(curl a)* = L- er [ 29». _ pal . 
2vVy 
In this same connection we repeat that for the three-dimensional divergence of a vector [sec (86.9)]: 


1 a 
diva = Sa ga a. 


To avoid misunderstandings when comparing with formulas frequentiy used for the three-dimensional 
vector operations in orthogonal curvilinear coordinates (see, for example, Electrodynamics of Continuous 
Media, appendix), _ we point out that in these formulas the components of the vectors are understood to be 


the quantities Vg; A'(= V 4, A1), V gaz A?, V ga A’. 
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2 
1 (2) —yas kek! — 0. 
[4 


we obtain: 


h 


Noting that the vector k* must have the direction of the vector dx“, we then find: 


where d! (84.6) is the element of spatial distance along the ray. In order to obtain the expression for 
Ka, We write 


[o 
ke — gk, = g' ko tgka = —g* - — y*^k,, 


Oo Oo f Yas dx? 
ka = — Ja k! -+ — £ j| = — -- —— d a]. 
(e) OCG ar) 


Finally, multiplying by dv", we obtain Fermat's principle in the form (dropping the constant factor 
wole): 


so that 


In a static field, we have simply: 


We call attention to the fact that in a gravitational field the ray does not propagate along the 
shortest line in space, since the latter would be defined by the equation 6 f dl = O. 


§ 89. Rotation 


As a special case of a stationary gravitational field, let us consider a uniformly rotating 
reference system. To calculate the interval ds we carry out the transformation from a system 
at rest (inertial system) to the uniformly rotating one. In the coordinates r’, œ’, 2’, t of the 
system at rest (we use cylindrical coordinates r', ¢’, z^), the interval has the form 


ds? 2c? dÜ-dr?^—r? d$? —dz". (89.1) 


Let the cylindrical coordinates in the rotating system be r, $, z. If the axis of rotation 
coincides with the axes Z and Z’, then we have r' =r, z =z, $' = $ Or, where Q is the 
angular velocity of rotation. Substituting in (89.1), we find the required expression for ds? 
in the rotating system of reference: 


ds? = (c? —Q?r?) d? 20r? d$ dt - dz? —? d$? — dr?. (89.2) 


It is necessary to note that the rotating system of reference can be used only out to distances 
equal to c/Q. In fact, from (89.2) we see that for r > c/Q, goo becomes negative, which is 
not admissible. The inapplicability of the rotating reference system at large distances is 
related to the fact that there the velocity would become greater than the velocity of light, 
and therefore such a system cannot be made up from real bodies. 

As in every stationary field, clocks on the rotating body cannot be uniquely synchronized 
at all points. Proceeding with the synchronization along any closed curve, we find, upon 
returning to the starting point, a time differing from the initial value by an amount [see 
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(88.5)] 
1 f go, l Qr?dó 
= — ~- — d = — — 
2 c $2 iuh Q2)? 
l- S 


or, assuming that Qr/c < | (i.e. that the velocity of the rotation is small compared with the 
velocity of light), 


Q 20 
aes i rdp = £^, (89.3) 


where S is the projected area of the contour on a plane perpendicular to the axis of rotation 
(the sign -- or — holding according as we traverse the contour in, or opposite to, the direc- 
tion of rotation). 

Let us assume that a ray of light propagates along a certain closed contour. Let us cal- 
culate to terms of order v/c the time t that elapses between the starting out of the light ray 
and its return to the initial point. The velocity of light, by definition, is always equal to c, 
if the times are synchronized along the given closed curve and if at each point we use the 
proper time. Since the difference between proper and world time is of order v?/c?, then 
in calculating the required time interval t to terms of order v/c this difference can be neglected. 
Therefore we have 

t= > z - S, 
c^ c 
where L is the length of the contour. Corresponding to this, the velocity of light, measured 
as the ratio L/t, appears equal to 


S 
+ = 89.4 
ct2Q T (89.4) 


This formula, like the first approximation for the Doppler effect, can also be easily derived 
in a purely classical manner. 


PROBLEM 
Calculate the element of spatial distance in a rotating coordinate system. 
Solution: With the help of (84.6) and (84.7), we find 


2342 
dl? = dr* dg 4 9 


r 
1-83 


which determines the spatial geometry in the rotating reference system. We note that the ratio of 
the circumference of a circle in the plane z = constant (with center on the axis of rotation) to its 


radius r is 
; 2n] V1 — Q?r?je?, 


i.e. larger than 27. 


§ 90. The equations of electrodynamics in the presence of a gravitational field 


The electromagnetic field equations of the special theory of relativity can be easily 
generalized so that they are applicable in an arbitrary four-dimensional curvilinear system 
of coordinates, i.e., in the presence of a gravitational field. 
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The electromagnetic field tensor in the special theory of relativity is. defined as 
Fy, = (04,/0x') -C(0A;/0x^). Clearly it must now be defined correspondingly as 
Fik = A,,;— A;,. But because of (86.12), 


Fu = Agim Ant = E - m (90.1) 


and therefore the relation of F;, to the potential A; does not change. Consequently the first 
pair of Maxwell equations (26.5) also does not change its formt 


oF Mu SPa g; (90.2) 
ex’ €x Cx 
In order to write the second pair of Maxwell equations, we must first determine the current 
four-vector in curvilinear coordinates. We do this in a fashion completely analogous to that 
which we followed in § 28. The spatial volume element, constructed on the space coordinate 
elements dx', dx?, and dx?, is Vy dV, where y is the determinant of the spatial metric 
tensor (84.7) and dV = dx! dx? dx? (see the tootin, on p. 232). We introduce the charge 
density o according to the definition de = ey l dV, where de is the charge located within 
the volume element Vy dV. Multiplying this equation on both sides by dx‘, we have: 
dx i 
dx 7.0 


de dx! = g dx! y dx! dx? dx? = ey -g dà —5 


\ Goo 
[where we have used the formula —g = 7 goo (84.10)]. The product = gdQ is the in- 
variant element of four-volume, so that the current four-vector is defined by the expression 


" gc dx! 
jJ = ELI (90.3) 
V Joo dx 
(the quantities dx'/dx? are the rates of change of the coordinates with the "time" x°., and 


do not constitute a four-vector). The component j° of the current four-vector, iG pd 
by V Goole, is the spatial density of charge. 

For point charges the density g is expressed as a sum of ó-functions, as in formula (28.1). 
We must, however, correct the definition of these functions for the case of curvilinear co- 
ordinates. By ó(r) we shall again mean He ce ó(x') ó(x^) 5(x3), regardless of the 
geometrical meaning of the coordinates x', x?, x3; then the integral over dV (and not over 
Jy dV) is unity: f ó(r) dV = 1. With this same E debien of the ó-functions, the charge 
density is 


and the current four-vector is 


lx! 
Dr = = He iom z (90.4) 


Conservation of charge is éznresied s A equation of continuity, which differs from 


t It is easily seen that the equation can also be written in the form 


Fixi Fior Fania = O, 
from which its covariance is obvious. 
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(29.4) only in replacement of the ordinary derivatives by covariant derivatives: 
à — 
Ja === Taco (90.5) 
vV —g Ox’ 
[using formula (86.9)]. 
The second pair of Maxwell equations (30.2) is generalized similarly; replacing the 
ordinary derivatives by covariant derivatives, we find: 
Phe t (JG Py = Sj (90.6) 
V —g Ox c 
[using formula (86.10)]. 
Finally the equations of motion of a charged particle in gravitational and electromagnetic 
fields is obtained by replacing the four-acceleration du'/ds in (23.4) by Du'/ds: 


Dui lui — ; 
me—— = mc (E I) - Í Phu, (90.7) 
PROBLEM 


Write the Maxwell equations in a given gravitational field in three-dimensional form (in the 
three-dimensional space with metric 7.5), introducing the three-vectors E, D and the antisymmetric 
three-tensors B,; and H..; according to the definitions: 


Ey = Foa, Bas = Fan, 
D: = — V goo Fe, H“ = V go F”. (1) 
Solution: The quantities introduced above are not independent. Writing out the equations 


Fou = goifam F", F = g“'g™ Fim, 


and introducing the three-dimensional metric tensor 7,5 = — guas +-hgegs [with g and A from (88.11)], 
and using formulas (84.9) and 84.12), we get: 
E, Hs 
D,= -= +-9"Has, Be=—_ +g E*—g'E®, 2 
Vg oh "RE g (2) 
We introduce the vectors B and H, dual to the tensors B,; and Has, in accordance with the definition: 
I D^ 
B° = Cate, H. = -3Y? ean H” (3) 


(see the footnote on p. 252; the minus sign is introduced so that in galilean coordinates the vector: 
H and B coincide with the ordinary magnetic field intensity). Then (2) can be written in the forms 


E H 
D = — rHx , B= —- + xE. 4 
y, H*g v; 8 (4) 


Introducing definition (1) in (90.2), we get the equations: 


Gv + Gye dx =O 
@Bes | 0E, GE, _ 
ox? ex^ — Ox* , 
or, changing to the dual quantities (3): 
1 e 


dvB-0, curl E= -ZEF (Vy B) (5) 
V? 
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(x° = ct; the definitions of the operations div and curl are given in the footnote on p. 252). Similarly 
we find from (90.6) the equations 


] 8 z 
vs ae = 4nQ, 


1 E 
3 (vy H+ d. 2 v3 D*) = —4ne — 
V7 ax? v ax? dx? 
or, in three-dimensional notation: 
1 2 - 4n 
div D = 479, curl H = —- — (Vy D)+ — s, 6 
$5 a yD)+ 2 (6) 


where s is the vector with components s* = ọ dx*/dt. 
We also write the continuity equation (90.5) in three-dimensional form: 


1 e - ; 
— — (Vy Q)4-divs = 0. (7) 
Vy et 

The reader should note the analogy (purely formal, of course) of equations (5) and (6) to the 
Maxwell equations for the electromagnetic field in material media. In particular, in a static gravita- 
tional field the quantity Vy drops out of the terms containing time derivatives, and relation (4) 


reduces to D = E/ V h, B = H/ Vh. We may say that with respect to its effect on the electromagnetic 
field a static gravitational field plays the role of a medium with electric and magnetic permeabilities 


e=p=i/Vh, 


CHAPTER 11 


THE GRAVITATIONAL FIELD EQUATIONS 


§ 9]. The curvature tensor 


Let us go back once more to the concept of parallel displacement of a vector. As we said 
in § 85, in the general case of a curved four-space, the infinitesimal parallel displacement of a 
vector is defined as a displacement in which the components of the vector are not changed in 
a system of coordinates which is galilean in the given infinitesimal volume element. 

If x! = x'(s) is the parametric equation of a certain curve (s is the arc length measured 
from some point), then the vector uí = dx'/ds is a unit vector tangent to the curve. If the 
curve we are considering is a geodesic, then along it Du' = 0. This means that if the vector 
u? is subjected to a parallel displacement from a point x‘ on a geodesic curve to the point 
x'+dx' on the same curve, then it coincides with the vector u' + du! tangent to the curve at 
the point x'+dx'. Thus when the tangent to a geodesic moves along the curve, it is displaced 
parallel to itself. 

On the other hand, during the parallel displacement of two vectors, the "angle" between 
them clearly remains unchanged. Therefore we may say that during the parallel displace- 
ment of any vector along a geodesic curve, the angle between the vector and the tangent 
to the geodesic remains unchanged. In other words, during the parallel displacement of a 
vector. its component along the geodesic must be the same at all points of the path. 

Now the very important result appears that in a curved space the parallel displacement of 
a vector from one given point to another gives different results if the displacement is carried 
out over different paths. In particular, it follows from this that if we displace a vector parallel 
to itself along some closed contour, then upon returning to the starting point, it will not 
coincide with its original value. 

In order to make this clear, let us consider a curved two-dimensional space, i.e., any 
curved surface. Figure 19 shows a portion of such a surface, bounded by three geodesic 
curves. Let us subject the vector I to a parallel displacement along the contour made up of 


Fic. 19. 
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these three curves. In moving along the line AB, the vector 1, always retaining its angle with 
the curve unchanged, goes over into the vector 2. In the same way, on moving along BC 
it goes over into 3. Finally, on moving from C to A along the curve CA, maintaining a 
constant angle with this curve, the vector under consideration goes over into 1’, not co- 
inciding with the vector 1. 

We derive the general formula for the change in a vector after parallel displacement 
around any infinitesimal closed contour. This change AA, can clearly be written in the form 
$ 5A,, where the integral is taken over the given contour. Substituting in place of 5A, the 
expression (85.5), we have 


AA, = $ Ti A dx! (91.0) 


(the vector 4; which appears in the integrand changes as we move along the contour). 

For the further transformation of this integral, we must note the following. The values of 
the vector A; at points inside the contour are not unique; they depend on the path along 
which we approach the particular point. However, as we shall see from the result obtained 
below, this non-uniqueness is related to terms of second order. We may therefore, with the 
first-order accuracy which is sufficient for the transformation, regard the components of the 
vector A; at points inside the infinitesimal contour as being uniquely determined by their 
values on the contour itself by the formulas 6A; = I7, 4,dx!, i.e., by the derivatives 

“ = IA, (91.2) 

Now applying Stokes’ theorem (6.19) to the integral (91.1) and considering that the area 

enclosed by the contour has the infinitesimal value Af'", we get: 


ML [e - UT arin 
2 Ox Ox 
1 rori eri , OA; _, 8A; 
= - |2 A — — A. +Ti, — -Ti — | arin 
2 | ôx! i ôx" it km Ay! At sal f 
Substituting the values of the derivatives (91.2), we get finally: 
AA, = dim A; Af", (91.3) 
where R/,, is a tensor of the fourth rank: 
Ra, = im LTD. pope lp Fiy (91.4) 
Óx ox" 


That R4, is a tensor is clear from the fact that in (91.3) the left side is a vector — the dif- 
ference AA, between the values of vectors at one and the same point. The tensor R4, is 
called the curvature tensor or the Riemann tensor. 

It is easy to obtain a similar formula for a contravariant vector A‘. To do this we note, 
since under parallel displacement a scalar does not change, that A(A4*B,) = 0, where B, is 
any covariant vector. With the help of (91.3), we then have 


14 B,R'uLAf "+ BAA‘ = 
= BAA + M'R'iLAf'") = 0, 


A(A‘B,) = A'AB, + B,AA* 


or, in view of the arbitrariness of the vector B,, 
AA‘ = — 4R mA Af” (91.5) 
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If we twice differentiate a vector A; covariantly with respect to x* and x, then the result 
generally depends on the order of differentiation, contrary to the situation for ordinary 
differentiation. It turns out that the difference A; ,; , — Aj, ;,, is given by the same curvature 
tensor which we introduced above. Namely, one finds the formula 


Aika nj Aitk ET AR” iets (91.6) 


which is easily verified by direct calculation in the locally-geodesic coordinate system 
Similarly, for a contravariant vector,t 


A ya D A ace rad A" Ra. (91.7) 


Finally, it is easy to obtain similar formulas for the second derivatives of tensors fthis is 
done most easily by considering, for example, a tensor of the form A; B,, and using formulas 
(91.6) and (91.7); because of the linearity, the formulas thus obtained must be valid for an 
arbitrary tensor Aj]. Thus 


Aim — Ama = AinR kim + AnkR iim (91.8) 


Clearly, in a flat space the curvature tensor is zero, for, in a flat space, we can choose 
coordinates such that over all the space all the T}, = 0, and therefore also R’,,,, = 0. Because 
of the tensor character of R’,,,, it is then equal to zero also in any other coordinate system. 
This is related to the fact that in a flat space parallel displacement isa single-valued operation, 
so that in making a circuit of a closed contour a vector does not change. 

The converse theorem is also valid: if R',),, = 0, then the space is flat. Namely, in any 
space we can choose a coordinate system which is galilean over a given infinitesimal region. 
If Riim = 0, then parallel displacement is a unique operation, and then by a parallel 
displacement of the galilean system from the given infinitesimal region to all the rest of the 
Space, we can construct a galilean system over the whole space, which proves that the space 
is Euclidean. 

Thus the vanishing or nonvanishing of the curvature tensor is a criterion which enables 
us to determine whether a space is flat or curved. 

We note that although in a curved space we can also choose a coordinate system which 
will be locally geodesic at a given point, at the same time the curvature tensor at this same 
point does not go to zero (since the derivatives of the T$, do not become zero along with 
the Fy). 


PROBLEMS 


1. Determine the relative four-acceleration of two particles moving along infinitely close geodesic 
world lines. 

Solution: Consider a family of geodesics differing in the value of some parameter v; in other 
words. the coordinates of the world point are expressed as functions x' = x’ (s, v), so that for each 
v -= const, this is the equation of a geodesic (where s is the length of interval measured along the 
line from its point of intersection with some given hypersurface). We introduce the four-vector 


ud 
ex 

qoc Q Ove, 
Cv 


t Formula (91.7) can also be obtained directly from (91.6) by raising the index / and using the symmetry 
properties of the tensor AR, ($ 92). 
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joining points on intinitely close geodesics (corresponding to parameter values v and v+dv) that 
have the same value of s. 

From the definition of a covariant derivative and the equality @u‘/@v = Qv'/Os (where u' = 
= Óx'jÓs), it follows that 


wav" = vu. (1) 
Now consider the second derivative: 
D?v' i . 
du E(u pl m Qu) ma ut lh eau un 


We use (1) again in the second term, and change the order of covariant differentiation in the first 
term using (91.7), and find: 
D'v = (uuu). +u" Ryu 
ds? S Hi mkt . 


The first term is zero, since u'u’ = 0 along geodesics. Introducing the constant factor ôv, we get 
the final equation: 


D? yi 
ds? 


which is called the geodesic deviation. 


= R'uuh* wn", (2) 


2. Write the Maxwell equations in the absence of charges for the 4-potential in the Lorentz gauge. 
Solution: The covariant generalization of the condition (46.9) has the form: 


A1, — 0. (1) 
Using formula (91.7) the Maxwell equations can be written as 
Fet = Agi — Az = Agh e AT RS Ari = 0 
with R, from (92.6). Then from Eq. (1): 
Azé — Ry A‘ — 0. (2) 


$ 92. Properties of the curvature tensor 


The curvature tensor has symmetry properties which can be made completely apparent 
by changing from mixed components R',,, to covariant ones: 


Rua = GinRitm 
By means of simple transformations it is easy to obtain the following expression: 


1/2gim , Og, Ogu 0g, 
=f —m EM Im Te uy" re 
iklm 2 Ge éx!@x™ ox*ox" 55) t 2, (UR Tin kml 5 . (92.1) 
From this expression one sees immediately the following symmetry properties: 
Rigim = ~ Ryu = — Rui (92.2) 
Ririm = Rimiks (92.3) 


i.e. the tensor is antisymmetric in each of the index pairs i, k and /, m, and is symmetric 
under the interchange of the two pairs with one another. In particular, all components 
Risim, in which i = k or / = m, are zero. 
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One also verifies that the cyclic sum of components of Rj,,, formed by permutation of 
any three indices, is equal to zero; e.g., 


Rikim + Rit Russ = 0. (92.4) 
[The other relations of this type are obtained from (92.4) automatically, because of the 
properties (92.2-3).] 
Finally, we also prove the Bianchi identity: 
R” Km F R” imka + R” imk =0. (92.5) 
It is most conveniently verified by using a locally-geodesic coordinate system. Because of 
its tensor character, the relation (92.5) will then be valid in any other system. Differentiating 
(91.4) and then substituting in it I';; = 0, we find for the point under consideration 
OR | OS _ OF, 
óx" — Ox"óx*  Ox"Ox" 
With the aid of this expression it is easy to verify that (92.5) actually holds. 
From the curvature tensor we can, by contraction, construct a tensor of the second rank. 
This contraction can be carried out in only one way: contraction of the tensor AR, on the 
indices i and k or / and m gives zero because of the antisymmetry in these indices, while 


contraction on any other pair always gives the same result, except for sign. We define the 
tensor Rj, (the Ricci tensor) ast 


n E 
R ikl;m = 


Ry = g'"Ry,, = Rig. (92.6) 
According to (91.4), we have: 
TA ôr, LEES 
ik 7 xi 2 EN tT im^ + ias (92.7) 


This tensor is clearly symmetric: 


Rik = Rjj (92.8) 
Finally, contracting R;,. wc obtain the invariant 
R = g^ Ry = gg" Ras, (92.9) 


which is called the scalar curvature of the space. 

The components of the tensor R; satisfy a differential identity obtained by contracting 
the Bianchi identity (92.5) on the pairs of indices ik and /n: 
1 cR 
2x" 

Because of the relations (92.2-4) not all the components of the curvature tensor are 
independent. Let us determine the number of independent components. 

The definition of the curvature tensor as given by the formulas written above applies to 
a space of an arbitrary number of dimensions. Let us first consider the case of two dimen- 
sions, i.e. an ordinary surface; in this case (to distinguish them from four-dimensional 
quantities) we denote the curvature tensor by Pasca and the metric tensor by Yap, where the 
indices a, b. ... run through the values 1, 2. Since in each of the pairs ab and cd the two 
indices must have different values, it is obvious that all the non-vanishing components of 


R! (92.10) 


m;t = 


+ In the literature one also finds another definition of the tensor Rx, using contraction of Rum on the first 
and last indices. This definition differs in sign from the one used here. 
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the curvature tensor coincide or differ in sign. Thus in this case there is only one independent 
component. for example P, ;,;. It is easily found that the scalar curvature is 


2P 
Pe REM hater) (92.11) 
The quantity P/2 coincides with the Gaussian curvature K of the surface: 
P 1 
LU a (92.12) 
2 0102 


where the 0;, 0; are the principal radii of curvature of the surface at the particular point 
(remember that o, and o» are assumed to have the same sign if the corresponding centers 
of curvature are on one side of the surface, and opposite signs if the centers of curvature 
lie on opposite sides of the surface; in the first case K > 0, while in the second K < 0. 

Next we consider the curvature tensor in three-dimensional space; we denote it by P,,,, 
and the metric tensor by 7,5, where the indices a, f run through values 1, 2, 3. The index 
pairs «f! and yó run through three essentially different sets of values: 23, 31, and 12 (per- 
mutation of indices in a pair merely changes the sign of the tensor component). Since the 
tensor P,5,, is symmetric under interchange of these pairs, there are all together 3-2/2 
independent components with different pairs of indices, and three components with 
identical pairs. The identity (92.4) adds no new restrictions. Thus, in three-dimensional 
space the curvature tensor has six independent components. The symmetric tensor Py, 
has the same number. Thus, from the linear relations P,e = g” P asp all the components 
of the tensor P,5,, can be expressed in terms of P,, and the metric tensor y, (see problem 
1). If we choose a system of coordinates that is cartesian at the particular point, then by a 
suitable rotation we can bring the tensor P,, to principal axes.t Thus the curvature tensor 
of a three-dimensional space at a given point is determined by three quantities.§ 

Finally we go to four-dimensional space. The pairs of indices ik and /m in this case run 
through six different sets of values: 01. 02, 03. 23. 31, 12. Thus there are six components 
Of Rin With identical, and 6-5/2 with different, pairs of indices. The latter, however. are 
still not independent of one another; the three components for which all four indices are 
different are related, because of (92.4), by the identity: 


Ror23 + Ro312 + Ro231 = 0. (92.13) 
Thus, in four-space the curvature tensor has a total of twenty independent components. 


t Formula (92.12) is easy to get by writing the equation of the surface in the vicinity of the given point 
(x = y =0) in the form z = (x?/20)) + (y?/29;). 
Then the square of the line element on it is 


2 TT 
dl? = (1 4 =) dx? i (1 JJ dy? +2 2" dy dy. 
o? ei Qı Q2 
Calculation of Py2i2 at the point x = y = 0 using formula (92.1) (in which only terms with second derivatives 
of the yag are needed) leads to (92.12). 

1 For the actual determination of the principal values of the tensor P; there is no nced to transform to a 
coordinate system that is cartesian at the given point. These values can be found by determining the roots 4 
of the equation |Pas — 4y,4| = 0 

§ Knowledge of the tensor P, ,,, enables us to determine the Gaussian curvature K of an arbitrary surface 
in the space. Here we note only that if the x!, x?, x? arc an orthogonal coordinate system, then 


Pins 


711722 — (ia)? 
is the Gaussian curvature for the "plane" perpendicular (at the given point) to the 4? axis; by a "plane" 
we mean a surface formed by geodesic lines. 
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By choosing a coordinate system that is galilcan at the given point and considering the 
transformations that rotate this system (so that the g, at the point are not changed), one 
can achieve the vanishing of six of the components of the curvature tensor (since there are 
six independent rotations of a four-dimensional coordinate system). Thus, in the general 
case the curvature of four-space is determined at each point by fourtcen quantities. 

If R, — 0,f then the curvature tensor has a total of ten independent components in an 
arbitrary coordinate system. By a suitable transformation we can then bring the tensor 
Ri (at the given point of four-space) to a "canonical" form, in which its components are 
expressed in general in terms of four independent quantities; in special cascs this number may 
be even smaller. 

If. however, Ry #0, then the same classification can be used for the curvature tensor 
after one has subtracted from it a particular part that is expressible in terms of the 
components R. Namely, we construct the tensor] 


C ikim = R ikim = Ri Iim + TR gu T Rudin = i Rem Git + tR(gà Gin — Jim Gut) (92. 14) 


It is easy to see that this tensor has all thc symmetry properties of the tensor Ra, but 
vanishes when contracted on a pair of indices (i/ or km). 

Let us show how one classifies the possible types of canonical forms of the curvature 
tensor when R,, = 0. (A. Z. Petrov, 1950). 

We shall assume that the metric at the given point in four-dimensional space has been 
brought to galilean form. We write the set of twenty independent components of the tensor 
Rixim as a collection of three-dimensional tensors defined as follows: 


Aag = Roog- Cup = $6.56 s R oia Bap = le, Rogys (92.15) 


(e,,, is the unit antisymmetric tensor; since the three-dimensional metric is cartesian, 
there is no need to deal with the difference between upper and lower indices in the summa- 
tion), The tensors A,, and C,, are symmetric by definition; the tensor B,, is asymmetric, 
while its trace is zero because of (92.13). According to the definitions (92.15) we have, 
for example. 


By, = Ry By, = Ross, Bi = Roa, Ci, = Ru. 


It is easy to see that the conditions Rym = 9" Rixm = 0 are equivalent to the following 
relations between the components of the tensors (92.15): 


A,, = 9, Bag = Bg. Aap = —Cap. (92.16) 
We also introduce the symmetric complex tensor 


This combining of the two real three-dimensional tensors A,, and B,, into one complex 
tensor corresponds precisely to the combination (in § 25) of the two vectors E and H into 
the complex vector F, while the resulting relation between D,, and the four-tensor Rigim 


t We shall see later ($95) that the curvature tensor forl the gravitational field in vacuum has this property. 
{This complicated expression can be written more compactly in the form: 
Cun = Ruin — Rii Jem t+ Ragen b RI Irm 
where the square brackets imply antisymmetrization over the indices contained in them: 
Aux) = M Au — Aa). 
The tensor (92.14) is called the Weyl tensor. 
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corresponds to the relation between F and the four-tensor F;,. It then follows that four- 
dimensional transformations of the tensor Rium are equivalent to three-dimensional 
complex rotations carried out on the tensor D,,. 

With respect to these rotations one can define eigenvalues A = 4'4 iA" and eigenvectors 
n, (complex, in general) as solutions of the system of equations 


D,gng = An,. (92.18) 


The quantities 4 are the invariants of the curvature tensor. Since the trace D,, = 0, the 
sum of the roots of equation (4) is zero: 


AU 4 40 4 AO) = 0. 

Depending on the number of independent eigenvectors n,, we arrive at the following 
classification of possible cases of reduction of the curvature tensor to the canonical Petroy 
types I-III. 

(I) There are three independent eigenvectors. Then their squares n,n’ are different from 
zero and by a suitable rotation we can bring the tensor D,,, and with it A,, and B,,, to 
diagonal form: 


AO» 0 0 V ye 0 0 
Ag={ 0 42” 0 2  By=l 0 4" 0 . (92.19) 
0 0 —OAOr LL jay 0 0 — Ze pare 


In this case the curvature tensor has four independent invariants.t 
The complex invariants 2"), 2(? are expressed algebraically in terms of the complex 
scalars 


1 ikim " - m 
l, = 48 ( Ru, R^ — i Ru, R"), 
(92.20) 
1 i ` mpr > i 
l= 96 (RimR"” Rp" +i R itm R" R, 5) 


where the asterisk over a symbol denotes the dual tensor: 


* 
Ritim = JE, RP im 
Calculating 7, and 7; using (92.19), we obtain: 
i, = 4(40)2 4 402 4. 2022), la = JADANA 4 201. (92.21) 
These formulas enable us to calculate 2", A? starting from the values of AR, in any 
reference system. 

(II) There are two independent eigenvectors. The square of one of them is then equal 
to zero, so that it cannot be chosen as the direction of one of the coordinate axes. One can, 
however, take it to lie in the x!, x? plane; then n, = in,, n, = 0. The corresponding 
equations (92.18) give: 

D,iD,-À  Dg-iD,-24À 
so that 
Dy, = À- ip, Dz. = å+ ip, Di; - p. 


T The degenerate case when A9” = 4", AO = A” is called type D in the literature. 
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The complex quantity 2 = 4'-F iA" is a scalar and cannot be changed. But the quantity yu 
can be given any nonzero value by a suitable complex rotation; we can therefore, without 
loss of generality, assume it to be real. As a result we get the following canonical type 
for the real tensors A,, and B,,: 


A n 0 A —u 0 0 
Ag= tu ^ 0 |, Bug = 0 A +h 0 F (92.22) 
0.0 -2A 0 0 — 2)" 


In this case there are just two invariants 4' and 4”. Then, in accordance with (92.22), 
I, = 27,1, = 4, so that I? = 22. 

(III) There is just one eigenvector, and its square is zero. All the eigenvalues À are then 
identical and consequently equal to zero. The solutions of equations (92.18) can be brought 
to the form D,, = D,, = Dj; = 0, Dj, = u, Dy, = ip, so that 


0 0 x 000 
Ay ={0 0 0) B,-[0 0 4] (92.23) 
u 0 0 Opo 


In this case the curvature tensor has no invariants at all and we have a peculiar situation: 
the four-space is curved, but there are no invariants which could be used as a measure of 
its curvature. t 


PROBLEMS 


1. Express the curvature tensor P.;,, of three-dimensional space in terms of the second-rank 
tensor Pas. 
Solution: We look for Pas,» in the form 


Pasys = Aa poo — Aao yirt Aso Yav Asy ò 
which satisfies the symmetry conditions; here Aas is some symmetric tensor whose relation to Pas 
is determined by contracting the expression we have written on the indices a and y. We thus find: 


Pa = Aya t Ass, Aa =P, —ł}Pyas, 
and finally, , , , , , , 


P 
Pasys = Pay Yos — Pas Yoyrt Pss Yar— Par Yast 2 (Yao Yay — Yar Ys). 
2. Calculate the components of the tensors Ry, and Ry for a metric in which gy, = 0 for i ^ k. 
Solution: We represent the nonzero components of the metric tensor in the form 
gu = ee, e=], e, = —l. 

The calculation according to formula (92.4) gives the following expressions for the nonzero com- 
ponents of the curvature tensor: 
Ry = &e? EF, x Fe it Fie Fri -FuaFius-Fiaui*ksl; 
Rui = € e (FF í.—Fic-Fuute e? (F, FiF} a aAA ias 


—e; e?Pi x €i Em exti 7 Po Fy m F, ms i * I 
msil 


(no summation over repeated indices!). The subscripts preceded by a comma denote ordinary 
differentiation with respect to the corresponding coordinate. 
Contracting the curvature tensor on two indices, we obtain: 


Re= E (Fa FeFkFaCBFaAFG—Fasitk; 
DE dk 

Ra = X [Ri Fu F? a Fnit eere FE, Fii F a Fri Fa E Fa). 
TT m*ii 


t The same situation occurs in the degenerate case (1l) when 4° = A” = 0; this case is called type N. 
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§ 93. The action function for the gravitational field 


To arrive at the equations determining the gravitational field, it is necessary first to 
determine the action S, for this field. The required equations can then be obtained by varying 
the sum of the actions of field plus material particles. 

Just as for the electromagnetic field, the action S, must be expressed in terms of a scalar 


integral fav—g dQ, taken over all space and over the time coordinate x? between two 
given values. To determine this scalar we shall start from the fact that the equations of the 
gravitational field must contain derivatives of the “potentials” no higher than the second 
(just as is the case for the electromagnetic field). Since the field equations are obtained by 
varying the action, it is necessary that the integrand G contain derivatives of gą no higher 
than first order; thus G must contain only the tensor g;, and the quantities i). 

However, it is impossible to construct an invariant from the quantities g,, and T}, alone. 
This is immediately clear from the fact that by a suitable choice of coordinate system we can 
always make all the quantities Fi, zero at a given point. There is, however, the scalar R 
(the curvature of the four-spacc), which though it contains in addition to the g, and its 
first derivatives also the second derivatives of g. is linear in the second derivatives. Because 


of this linearity, the invariant integral | Ry =g dQ can be transformed by means of Gauss? 
theorem to the integral of an expression not containing the second derivatives. Namely, 


[RV =g dQ can be presented in the form 


Í RV Zg dQ = | G4 Zg dQ + Í A ap. 


where G contains only the tensor g;, and its first derivatives, and the integrand of the second 
integral has the form of a divergence of a certain quantity w“ (the detailed calculation is 
given at the end of this section). According to Gauss’ theorem, this second integral can be 
transformed into an integral over a hypersurface surrounding the four-volume over which 
the integration is carried out in the other two integrals. When we vary the action, the 
variation of the second term on the right vanishes, since in the principle of least action, the 
variations of the field at the limits of the region of integration are zero. Consequently, we 
may write 


5 f R/ -g dQ = ô | GV —g dQ. 


The left side is a scalar; therefore the expression on the right is also a scalar (the quantity G 
itself is, of course, not a scalar). 
The quantity G satisfies the condition imposed above, since it contains only the gj, and 
its derivatives. Thus we may write 
c? (se ? 
ôS, = -gi fo -gdQ= = l6nk 
where & is a new universal constant. Just as was done for the action of the clectromagnetic 
field in $ 27, we can see that the constant A must be positive (see the end of this section). 
The constant K is called the gravitational constant. The dimensions of k follow from (93.1). 
The action has dimensions gm-cm?-sec ^! ; all the coordinates have the dimensions cm, 
the gi, are dimensionless, and so R has dimensions cm~?. As a result, we find that k has 
the dimensions cm?-gm ^ !-sec ^ ?. Its numerical value is 


k = 6.67 x 1075 cm?-gm  !-sec ^". (93.2) 


AL -g dQ, (93.1) 
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We note that we could have set k equal to unity (or any other dimensionless constant). 
However, this would fix the unit of mass.t 

Finally, let us calculate the quantity G of (93.1). From the expression (92.10) for Ry, we 
have 


—— — (ATO SET ul N 
sag R=; =g g^ Ri = V-g {oi aos" EQ * 9^1 ik sv, 


In the first two terms on the right, we have 


Foo, Th 0 FO . ĉ fT ; 
V9 9" "ad 7 sd(N -9 g Vi) - Va sa g g”, 
—— Q04 ô —, Ó —— , 
v—09^ aa = za 9g" Ti) - To za lV g g^). 


Dropping the total derivatives. we find 


E 


F ln m € P4 e [ik m m ik l, 
vog G = Tin ga (V mg g”) Ti a 799) (Tis Tin Tigy —8- 


With the aid of formulas (86.5)-(86.8). we find that the first two terms on the right are 
equal to v —g multiplied by 


2Tf, Tim g”™ 7 Vin 


Tug" — Ta Dmg" = g (20a Ci V Ea — Fi Vm) 
= 2g'( Ti D 7 T Vin): 


Finally, we have 
G = g" (Ti Tis 7 Vie Fm). (93.3) 


The components of the metric tensor are the quantities which determine the gravitational 
field. Therefore in the principle of least action for the gravitational field it is the quantities 
ga Which are subjected to variation. However, it is necessary here to make the following 
fundamental reservation. Namely, we cannot claim now that in an actually realizable field 
the action integral has a minimum (and not just an extremum) with respect to a// possible 
variations of the gą. This is related to the fact that not every change in the g; is associated 
with a change in the space-time metric, i.e. with a real change in the gravitational field. 
The components g; also change under a simple transformation of coordinates connected 
merely with the shift from one system to another in one and the same space-time. Each such 
coordinate transformation is generally an aggregate of four independent transformations. 
in order to exclude such changes in g,, which are not associated with a change in the metric, 
we can impose four auxiliary conditions and require the fulfillment of these conditions under 
the variation. Thus, when the principle of least action is applied to a gravitational field, we 


+ If one sets k = c?, the mass is measured in cm, where 1 cm = 1.35 x 10?? gm. Sometimes one uses in 
place of & the quantity 
8zk 
c 


x= = 1.86 x 10°27 cm gm^!, 


which is called the Einstein gravitational constant. 


270 THE GRAVITATIONAL FIELD EQUATIONS § 94 


can assert only that we can impose auxiliary conditions on the g,,, such that when they are 
fulfilled the action has a minimum with respect to variations of the g,.T 
Keeping these remarks in mind, we now show that the gravitational constant must be 
positive. As the four auxiliary conditions mentioned, we use the vanishing of the three 
components goa, and the constancy of the determinant |g,,| made up from the components 
of gag: 
do,70,  |g,j| = const; 
from the last of these conditions we have 
"LP LN Igal = 0 
ax? Ax? 7" i 
We are here interested in those terms in the integrand of the expression for the action which 
contain derivatives of g;, with respect to x? (cf. p. 68). A simple calculation using (93.3) 
shows that these terms in G are 
zd g^g?99? Gay Ipa 
4 0x9 0x? 
It is easy to see that this quantity is essentially negative. Namely, choosing a spatial system 
of coordinates which is cartesian at a given point at a given moment of time (so that 


aB 
E g” (C i 
4 exo}? 


Gap = g^ = —Ó,,4), we obtain: 
and, since g?? = 1/goo > 0, the sign of the quantity is obvious. 

By a sufficiently rapid change of the components g,, with the time x? (within the time 
interval between the limits of integration of x?) the quantity G can consequently be made 
as large as one likes. If the constant & were negative, the action would then decrease without 
limit (taking on negative values of arbitrarily large absolute magnitude), that is, there could 
be no minimum. 


§ 94. The energy-momentum tensor 


In § 32 the general rule was given for calculating the energy-momentum tensor of any 
physical system whose action is given in the form of an integral (32.1) over four-space. In 
curvilinear coordinates this integral must be written in the form 


1 ae 
s=: Í AY —g dQ (94.1) 


(in galilean coordinates g = —1, and S goes over into f A dV dt). The integration extends 

over all the three-dimensional space and over the time between two given moments, i.e., over 
the infinite region of four-space contained between two hypersurfaces. 

As already discussed in $ 32, the energy-momentum tensor, calculated from the formula 

' (32.5), is generally not symmetric, as it should be. In order to symmetrize it, we had to add 


t We must emphasize, however, that everything we have said has no effect on the derivation of the field 
equations from the principle of least action (8 95). These equations are already obtained as a result of the 
requirement that the action be an extremum (i.e., vanishing of the first derivative), and not necessarily a 
minimum. Therefore in deriving them we can vary all of the gı independently. 
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to (32.5) suitable terms of the form (ĉ/ôx')Yi where Wig = —Win. We shall now give 
another method of calculating the energy-momentum tensor which has the advantage of 
leading at once to the correct expression. 

In (94.1) we carry out a transformation from the coordinates x! to the coordinates 
x = x E, where the £/ are small quantities. Under this transformation the g'* are trans- 
formed according to the formulas: 


Aet Ak ayi k 

i Cx’! 6x . 0€ .. 0€ 
g(x") = g'"(x!) ———— = g!™ | 644+ db T 
g )-29"(x) Axi ox" g l od mt 3x" 


óc* 
ox" 


Axi 
kt eS 
+9 za 
ex 


~ gi(x')+gim 
Here the tensor g'^ is a function of the x’', while the tensor g* is a function of the original 
coordinates x'. In order to represent all terms as functions of one and the same variables, we 
expand g’*(x'+ &') in powers of &'. Furthermore, if we neglect terms of higher order in £, 
we can in all terms containing ¢', replace g’* by g^. Thus we find 
; ; A ae og NU 
g(x!) = gi(x')—eé! = +g" es kl 
Óx 
It is easy to verify by direct trial that the last three terms on the right can be written as a 
sum čt 4+ č“ i of contravariant derivatives of the č. Thus we finally obtain the transforma- 
tion of the g^ in the form l 


g'* 2 g*+ôg*, dgi* = ERR EN (94.2) 
For the covariant components, we have: 
gu = gu t Ógi Óga = = čik ii (94.3) 


(so that, to terms of first order we satisfy the condition gag'*! = ó).t 

Since the action S is a scalar, it does not change under a transformation of coordinates. 
On the other hand, the change 6S in the action under a transformation of coordinates can 
be written in the following form. As in § 32, let g denote the quantities defining the physical 
system to which the action S applies. Under coordinate transformation the quantities q 
change by óq. In calculating 6S we need not write terms containing the changes in q. All 
such terms must cancel each other by virtue of the "equations of motion” of the physical 
system, since these equations are obtained by equating to zero the variation of S with 
respect to the quantities g. Therefore it is sufficient to write the terms associated with changes 
in the gą. Using Gauss’ theorem, and setting 6g = 0 at the integration limits, we find 6S 
in the formt 


t We note that the equations 
Eb ky. Flt = Q 
determine the infinitesimal coordinate transformations that do not change the metric. In the literature these 
are often called the Killing equations. 

t It is necessary to emphasize that the notation of differentiation with respect to the components of the 
symmetric tensor g, which we introduce here, has in a certain sense a symbolic character. Namely, the 
derivative 2F/2g, (F is some function of the gx) actually has a meaning only as the expression of the fact 
that dF = (9F/0g,)dg,,. But in the sum (3F/?gıx)dgıx, the terms with differentials dg,,, of components with 
i+ k, appear twice. Therefore in differentiating the actual expression for F with respect to any definite 
component gy with / x k, we would obtain a value which is twice as large as that which we denote by 
@F/@g1,.. This remark must be kept in mind if we assign definite values to the indices i, k, in formulas in which 
the derivatives with respect to g;, appear. 
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1 f (av—-gA —g ^ . agit 
ss =~ { ar al 49 79^ 520") ag 


c 0g** ox! 
ôx! j 
1 CV-gA 6 6V-gAl]. a 
=- kx x | ôg” dQ. 
c cg ex  .0g 
e ERI 
Ox 
Here we introduce the notation 
| — vV —g ^ 6 0x —g ^ 94.4 
3N g Tar = —u- — zz (94.4) 
2 óg' ex 6g" 
ox! 
Then ôS takes the formt 
! . 1 , - 
ôS = | T,óg* 4 —g dQ = — xl T^óg,  —g dQ (94.5) 


ik 


(note that g ôg = —gu 6g", and therefore T^ ôg = — T dg'*). Substituting for ôg 
the expression (94.2), we have, making use of the symmetry of the tensor 7;,, 


I , , ! "E 
a= x | nitet Lan - t | Tyga do. 


Furthermore, we transform this expression in the foliowing way: 
1 i 1 ; 
=7 Í (T ë) av —g dQ— c Í T} či N —g dQ. (94.6) 


Using (86.9), the first integral can be written in the form 


JE za (V 8 Tič’) dO, 


and transformed into an integral over a hypersurface. Since the č’ vanish at the limits of 
integration, this integral drops out. 
Thus, equating ôS to zero, we find 


1 a 
ós--- Í Tt CV —g dQ =0. 


Because of the arbitrariness of the £' it then follows that 

Ti, = 0. (94.7) 
Comparing this with equation (32.4) 07,,/0x* = 0, valid in galilean coordinates, we see that 
the tensor Ti, defined by formula (94.4), must be identical with the energy-momentum 
tensor—at least to within a constant factor. It is easy to verify, carrying out, for example, 
the calculation from formula (94.4) for the electromagnetic field 


1 . 1 
k 
(^- -ig EE = 7 lón n. Fi Fing"'9 e) 
that this factor is equal to unity. 

t In the case we are considering, the ten quantities ôg are not independent, since they are the result of a 


transformation of the coordinates, of which there are only four. Therefore from the vanishing of óS it does 
not follow that 7, — 0! 
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Thus, formula (94.4) enables us to calculate the energy-momentum tensor by dif- 
ferentiating the function A with respect to the components of the metric tensor (and their 
derivatives). The tensor 7;, obtained in this way is symmetric. Formula (94.4) is convenient 
for calculating the energy-momentum tensor not only in the case of the presence of a gravita- 
tional field, but also in its absence, in which case the metric tensor has no independent 
significance and the transition to curvilinear coordinates occurs formally as an intermediate 
step in the calculation of T;,. 

The expression (33.1) for the energy-momentum tensor of the electromagnetic field must 
be written in curvilinear coordinates in the form 


T, = 5 (~FuFi'+Z FinPau)) (94.8) 
For a macroscopic body the energy-momentum tensor is 
T, = (pe £)u;u, — pga. (94.9) 
We note that the quantity Too is always positive:t 
Too 2 O. (94.10) 


(No general statement can be made about the mixed component T$.) 


PROBLEM 


Consider the possible cases of reduction to canonical form of a symmetric tensor of second rank 
in a pseudo-euclidean space. 
Solution: The reduction of a symmetric tensor A to principal axes means that we find “‘eigen- 
vectors" nt for which 
Aj nh = n. (1) 
The corresponding principal (or **proper") values 2 are obtained from the condition for consistency 
of equation (1), i.e. as the roots of the fourth degree equation 


|Air — Aga =0, (2) 
and are invariants of the tensor. Both the quantities 4 and the eigenvectors corresponding to them 
may be complex. (The components of the tensor Aix itself are of course assumed to be real.) 

From equation (1) it is easily shown in the usual fashion that two vectors 2,“ and 2, which 
correspond to different principal values 4 and 4? are "mutually perpendicular”: 
nin — Q, (3) 
In particular, if equation (2) has complex-conjugate roots 2 and 4*, to which there correspond the 
complex-conjugate vectors n; and n*, then we must have 
n; n'* = 0. (4) 


The tensor A is expressed in terms of its principal values and the corresponding eigenvectors by 


the formula 
HH 


(5) 


Aw —XÀ i 
ACR 


(so long as none of the quantities 1, n’ is equal to zero—cf. below). 


t We have Too = cug : p(uo — goo). The first term is always positive. In the second term we write 

Goo dx? + Gon dx^ 

T ds m 

and obtain after a simple transformation goo p(di/ds)?, where dí is the element of spatial distance (84.6); 


from this it is clear that the second term of Too is also positive. The same result can also be shown for the 
tensor (94.8). 


Ho = goo? 1 go, u* = 
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Depending on the character of the roots of equation (2), the following three situations may occur. 
(a) All four eigenvalues 4 are real. Then the vectors n are also real, and since they are mutually 
perpendicular, three of them must have spacelike directions and one a timelike direction (and are 


normalized by the conditions nr! = —1 and nn! = 1, respectively). Choosing the directions of the 
coordinates along these vectors, we bring the tensor Ay, to the form 
Pu 0 0 0 
0 —20 0 0 
An = 0 on Sa (6) 
0 0 0 —A40 


(b) Equation (2) has two real roots (2, 4?) and two complex-conjugate roots (4'-- i4"). We 
write the complex-conjugate vectors n, n*, corresponding to the last two roots in the form a,4- ibi; 
since they are defined only to within an arbitrary complex factor, we can normalize them by the 
condition n nt = n*n'* = 1. Also using equation (4), we find 

a, a! -- b, bt — 0, a, b! — 0, a, Gi — b, b! = 1, 
So that 
ad! = 3, bib' = —}, 
i.e. one of these vectors must be spacelike and the other timelike.t Choosing the coordinate axes 
along the vectors af, b, n'?", n'3", we bring the tensor to the form: 


A 2M 0 0 
A o— 0 0 

Au = ^ 08^ uae (7) 
0 0 0 — 49 


(c) If the square of one of the vectors x! is equal to zero (7t; zi! = 0), then this vector cannot be 
chosen as the direction of a coordinate axis. We can however choose one of the planes x?, x* so 
that the vector n‘ lies in it. Suppose this is the x°, x! plane; then it follows from nm n! =0 that 
n? — n, and from equation (1) we have Agot Aoi = 4, Aio 411 = — å, SO that Ai = — Ad 
Aoo = A+ 4, Aoi = — 4, where z is a quantity which is not invariant but changes under rotations 
in the x°, x! plane; it can always be made real by a suitable rotation. Choosing the axes x?, x? 
along the other two (spacelike) vectors n?*, n9, we bring the tensor A; to the form 


At+H —Hu 0 0 
=g situ 0 0 
dme NE^ 0 12 0 9 
0 0 0 — A0 


This case corresponds to the situation when two of the roots (49, 4€?) of equation (2) are equal. 
We note that for the physical energy-momentum tensor Tix of matter moving with velocities less 
than the velocity of light only case (a) can occur; this is related to the fact that there must always 
exist a reference system in which the flux of the energy of the matter, i.e. the components Teo are 
equal to zero. For the energy-momentum tensor of electromagnetic waves we have case (c) with 
A® = 4G) = 4 — 0 (cf. p. 82); it can be shown that if this were not the case there would exist a 
reference frame in which the energy flux would exceed the value c times the energy density. 


$ 95. The Einstein equations 


We can now proceed to the derivation of the equations of the gravitational field. These 
equations are obtained from the principle of least action 6(S,,+5S,) = 0, where S, and Sm 
are the actions of the gravitational field and matter respectively. We now subject the 
gravitational field, that is, the quantities g;,, to variation. 


t Since only one of the vectors can have a timelike direction, it then follows that equation (2) cannot 
have two pairs of complex-conjugate roots. 
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Calculating the variation óS,, we have 
5 f R4 =g dQ =ô f g*Ry v —g dQ 
= [ {Ra V — 8 ôg” + Rug" Zg +g" VZ g ÓR,)dO. 
From formula (86.4), we have 
1 — l 
6V-g= -— bg = — LN NT LE 
N g Ne g 2 N 9 9.099 
substituting this, we find 
ô | RV-gdQ= [ (Ru-douR)5g" V -g dQ + [g*$R. V =g dQ. (95.1) 


For the calculation of 6R;, we note that although the quantites Ti, do not constitute a 
tensor, their variations ôl}, do form a tensor, for L4 A4, dx! is the change in a vector under 
parallel displacement [see (85.5)] from some point P to an infinitesimally separated point 
P'. Therefore ôT% A, dx' is the difference between the two vectors, obtained as the result of 
two parallel disolacehents (one with the unvaried, the other with the varied I 1,) from the 
point P to one and the same point P '. The difference between two vectors at the same point 
is a vector, and therefore STi, is a tensor. 

Let us use a locally geodesic system of coordinates. Then at that point all the Ij, = 0. 
With the help of expression (92.10) for the R;,, we have (remembering that the first derivatives 
of the g'* are now equal to zero) 

: . {0 é "E ir ow! 
g" OR, = g^ {= oL - ox oT; JE g^ ox ôli — g] EN D ks "P 
where 
w= g* ori — g” STi. 
Since w' is a vector, we may write the relation we have obtained, in an arbitrary coordinate 
system, in the form 1 


F 6 y— 
g"5Ry = aig ask (V —g w’) 
[replacing dw'/dx' by w' , and using (86.9)]. Consequently the second integral on the right 


side of (95.1) is equal to 
IE KSR N M gdQ= je =g w') dQ, 


and by Gauss’ theorem can be transformed into an integral of w! over the hypersurface 
surrounding the whole four-volume. Since the variations of the field are zero at the integra- 
tion limits, this term drops out. Thus, the variation ôS, is equal tot 
3 
c 


1 — 
= ——— ae, ik f 7 


We note that if we had started from the expression 


3 
S,=- eq | 677 40 


+ We note here the following curious fact. If we calculate the variation ôf R Y —g dQ [with Ru from 
(92.10)], considering the Ij. as independent variables and the gix as constants, and then use expression 
(86.3) for the {:, we would obtain, as one easily verifies, identically zero. Conversely, one could determine 
the relation between the T}, and the metric tensor by requiring that the variation we have mentioned should 
vanish. 
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for the action of the field, then we would have obtained 


e (G4-g ð aGJ-g). 4 
5S == Im —— —— | eg" dQ. 
NT EE a c d je i 
ox! 
Comparing this with (95.2), we find the following relation: 
1 | (aGv—g) 6 &Gv-g 
Rss ge s ANS cH Cona y m , (95.3) 
2 V -g 0g' ox 40g 
0 — 
ax! 


For the variation of the action of the matter we can write immediately from (94.5): 
1 n 
OS, = Xx Í T, ôg” v -—g dQ, (95.4) 


where T; is the energy-momentum tensor of the matter (including the electromagnetic field). 
Gravitational interaction plays a role only for bodies with sufficiently large mass (because 
of the smallness of the gravitational constant), and therefore in studying the gravitational 
field we usually have to deal with macroscopic bodies. Corresponding to this we must usually 
write for 7;, the expression (94.9). 

Thus, from the principle of least action 6S,,+65S, = 0 we find: 


e? l Srk. . 
= exl (Ra- 594R- s Ta) óg^ 4 =g dQ — 0, 


from which, in view of the arbitrariness of the og": 


l 8nk 
Rix 2 gi R — c Ti (95.5) 
or, in mixed components, 
Ri- ; ôi R = ^ T (95.6) 


These are the required equations of the gravitational field—the basic equation of the general 
theory of relativity. They are called the Einstein equations. 
Contracting (95.6) on the indices i and k, we find 


-——T; (95.7) 
(T = Ti). Therefore the equations of the field can also be written in the form 


8nzk f, ! 
Ry = PS (r.- 2 Jik r). (95.8) 

The Einstein equations are nonlinear. Therefore for gravitational fields the principle of 
superposition is not valid. The principle is valid only approximately for weak fields which 
permit a linearization of the Einstein equations (in particular, the gravitational field in the 
classical Newtonian limit, cf. § 99). 
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In empty space 77, = 0, and the equations of the gravitational field reduce to the equation 

Ry = 0. (95.9) 

We mention that this does not at all mean that in vacuum, spacetime is flat; for tiis we 
would need the stronger conditions Ri,,, = 0. 

The energy-momentum tensor of the electromagnetic field has the property that Ti = 0 
[see (33.2)]. From (95.7), it follows that in the presence of an electromagnetic field without 
any masses the scalar curvature of spacetime is zero. 

As we know, the divergence of the energy-momentum tensor is zero: 

Tki = 0; (95.10) 
therefore the divergence of the left side of equation (95.6) must be zero. This is actually the 
case because of the identity (92.13). 

Thus the equation (95.10) is essentially contained in the field equations (95.6). On the 
other hand, the equation (95.10), expressing the law of conservation of energy and momen- 
tum, contains the equation of motion of the physical system to which the energy-momentum 
tensor under consideration refers (i.e., the equations of motion of the material particles or 
the second pair of Maxwell equations). Thus the equations of the gravitational field also 
contain the equations for the mat*?r which produces this field. Therefore the distribution 
and motion of the matter producing the gravitational field cannot be assigned arbitrarily. 
On the contrary, they must be determined (by solving the field equations under given initial 
conditions) at the same time as we find the field produced by the matter. 

We call attention to the difference in principle between the present situation and the one 
we had in the case of the electromagnetic field. The equations of that field (the Maxwell 
equations) contain only the equation of conservation of the total charge (the continuity 
equation), but not the equations of motion of the charges themselves. Therefore the distribu- 
tion and motion of the charges can be assigned arbitrarily, so long as the total charge is 
constant. Assignment of this charge distribution then determines, through Maxwell's 
equations, the electromagnetic field produced by the charges. 

We must, however, make it clear that for a complete determination of the distribution 
and motion of the matter in the case of the Einstein equations one must still add to them the 
equation of state of the matter, i.e. an equation relating the pressure and density. This 
equation must be given along with the field equations.t 

The four coordinates x‘ can be subjected to an arbitrary transformation. By means of these 
transformations we can arbitrarily assign four of the ten components of the tensor gi. 
Therefore there are only six independent quantities gą. Furthermore, the four components 
of the four-velocity u’, which appear in the energy-momentum tensor of the matter, are 
related to one another by u'u; = 1, so that only three of them are independent. Thus we have 
ten field equations (95.5) for ten unknowns, namely, six components of g;,, three components 
of v, and the density e/c? of the matter (or its pressure p). 

For the gravitational field in vacuum there remain a total of six unknown quantities 
(components of gą) and the number of independent field equations is reduced corres- 
pondingly: the ten equations R;, = 0 are connected by the four identities (92.10). 


t Actually the equation of state relates to one another not two but three thermodynamic quantities, for 
example the pressure, density and temperature of the matter. In applications in the theory of gravitation, this 
point is however not important, since the approximate equations of state used here actually do not depend 
on the temperature (as, for example, the equation p = 0 for rarefied matter, the limiting extreme-relativistic 
equation p = e/3 for highly compressed matter, etc.). 
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We mention some peculiarities of the structure of the Einstein equations. They are a 
system of second-order partial differential equations. But the equations do not contain the 
time derivatives of all ten components g;,,. In fact it is clear from (92.1) that second derivatives 
with respect to the time are contained only in the components Roso g of the curvature tensor, 
where they enter in the form of the term —4g,, (the dot denotes differentiation with 
respect to x°); the second derivatives of the components go, and goo do not appear at all. 
It is therefore clear that the tensor R, which is obtained by contraction of the curvature 
tensor, and with it the equations (95.5), also contain the second derivatives with respect to 
the time of only the six spatial components g, ;. 

It is also easy to see that these derivatives enter only in the ^-equation of (95.6), i.e. the 
equation 

8nk 


Ra ôR = ~| Ti (95.11) 
The $ and ? equations, i.e. the equations 
8nk 8nk 
Ro-4R= x T, R= A Te, (95.12) 


contain only first-order time derivatives. One can verify this by checking that in forming the 
quantities R? and R8—4R = I(RQ— R2) from Ry, by contraction, the components of the 
form Ro, actually drop out. This can be seen even more simply from the identity (92.10), 
by writing it in the form 


(RP—399R), o = — (R1 ER). (95.13) 


(i = 0, 1, 2, 3). The highest time derivatives appearing on the right side of this equation are 
second derivatives (appearing in the quantities Rf, R). Since (95.13) is an identity, its left 
side must consequently contain no time derivatives of higher than second order. But one 
time differentiation already appears explicitly in it; therefore the expressions RP—46°R 
themselves cannot contain time derivatives of order higher than the first. 

Furthermore, the left sides of equations (95.12) also do not contain the first derivatives 
Goa and doo (but only the derivatives g,,). In fact, of all the T; xn only Ta, o9 and To, o6 
contain these quantities, but these latter in turn appear only in the components of the 
curvature tensor of the form Rogog which, as we already know, drop out when we form the 
left sides of equations (95.12). 

If one is interested in the solution of the Einstein equations for given initial conditions 
(in the time), we must consider the question of the number of quantities for which the initial 
spatial distribution can be assigned arbitrarily. 

The initial conditions for a set of equations of second order must include both the 
quantities to be differentiated as well as their first time derivatives. But since in the present 
case the equations contain second derivatives of only the six gap, not all the g and dj, can 
be arbitrarily assigned. Thus, we may assign (in addition to the velocity and density of the 
matter) the initial values of the functions gag and ġ „g, after which the four equations (95.12) 
determine the admissible initial values of go, and goo; in (95.11) the initial values of go, 
still remain arbitrary. 

Among the initial conditions thus assigned there are some functions whose arbitrariness 
is related simply to the arbitrariness in choice of the four-dimensional coordinate system. 
But the only thing that has real physical meaning is the number of “physically different" 
arbitrary functions, which cannot be reduced by any choice of coordinate system. From 
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physical arguments it is easy to see that this number is eight: the initial conditions must 
assign the distribution of the matter density and of its three velocity components, and also 
of four other quantities characterizing the free gravitational field in the absence of matter 
(see later in § 107); for the free gravitational field in vacuum only the last four quantities 
should be fixed by the initial conditions. 


PROBLEM 


Write the equations for a constant gravitational field, expressing all the operations of differentia- 
tion with respect to the space coordinates as covariant derivatives in a space with the metric yas 
(84.7). 

Solution: We introduce the notation goo == A, Joa = —hga (88.11) and the three-dimensional 
velocity v“ (88.10). In the following all operations of raising and lowering indices and of covariant 
differentiation are carried out in the three-dimensional space with the metric yes, on the three- 
dimensional vectors ga, t“ and the three-dimensional scalar h. 

The desired equations must be invariant with respect to the transformation 

xt x", xe — x? + f(x), (1) 
which does not change the stationary character of the field. But under such a transformation, as is 
easily shown (see the footnote on p. 248), ga—>ga—4f/@x", while the scalar A and the tensor 
Yas = —Gaat+hgages are unchanged. It is therefore clear that the required equations, when expressed 
in terms of yes, h and ga, can contain ga only in the form of combinations of derivatives that con- 
stitute a three-dimensional antisymmetric tensor: 

[2 [7 
fea =r; agaa = LT as (2) 
which is invariant under such transformations. Taking this fact into account, we can drastically 
simplify the computations by setting (after computing all the derivatives appearing in Riu.) ga = 0 
and ga: s +ga: « = O-t 
The Christoffel symbols are: 


Tj, = 5 g'h: o 

Tj, = hs, 

PER EL 2 

rs, Ms jo s, 

r= GE E) rait tht 


h 
D = Myr 5 (of 0 gf e+ ttt 


The terms omitted (indicated by the dots) are quadratic in the components of ga; these terms do 
drop out when we set ga = 0 after performing the differentiations in Ry (92.10). In the calculations 
one uses formulas (84.9), (84.12-13); the 4%, are three-dimensional Christoffel symbols constructed 
from the metric yas. 


t To avoid any misunderstanding we emphasize that this simplified method for making the computations, 
which gives the correct field equations, would not be applicable to the calculation of arbitrary components 
of the Ri, itself, since they are not invariant under the transformation (1). In equations (3)-(5) on the left 
are given those components of the Ricci tensor which are actually equal to the expressions given. These 
components are invariant under (1). 
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The tensor 7i, is calculated using formula (94.9) with the u! from (88.14) (where again we set 
Ga — 0). 
As a result of the oe we obtain hi following equations from (95.8): 


1 4 7 dus etp  &—p 
i Ro= (Vh) ta faf B= “Sao a (3) 
la 
1 Vh = pte v* 
ae oi fa (T4 = S EO 
Vh R$ 2 f**:5 af (Vh); 8 m c (4) 
h Sak | (pepo | e—p 
a8 — ag _ far £8 0:8 — —* yaB 
R” = pergo fette y; Wi -T T zo”) (5) 
c? 


Here P^ is a three-dimensional tensor constructed from the yas in the same way as R is con- 
structed from the gix.f 


§ 96. The energy-momentum pseudotensor of the gravitational field 


In the absence of a gravitational field, the law of conservation of energy and momentum 
of the material (and electromagnetic field) is expressed by the equation OT ‘*/dx* = 0. 
The generalization of this equation to the case where a gravitational field is present is 
equation (94.7): 


k 
RN ATEN =9) 1990 pu _ 9 mes 


In this form, however, this equation does not generally express any conservation law 
whatever.{ This is related to the fact that in a gravitational field the four-momentum of the 
matter alone must not be conserved, but rather the four-momentum of matter plus gravita- 
tional field; the latter is not included in the expression for TE 

To determine the conserved total four-momentum for a gravitational field plus the matter 
located in it, we proceed as follows (L. D. Landau and E. M. Lifshitz, 1947). We choose 
a system of coordinates of such form that at some particular point in spacetime all the 


t The Einstein equations can also be written in an analogous way for the general case of a time-dependent 
metric. In addition to space derivatives they will also contain time derivatives of the quantities yas, ga, and h. 
See A. L. Zel’manov, Doklady Acad. Sci., U.S.S.R. 107, 815 (1956). 

1 Because the integral f T! ,/ — g dS, is conserved only if the condition 

AVIT!) _ 
ax* 
is fulfilled, and not (96.1). This is easily verified by carrying out in curvilinear coordinates all those cal- 
culations which in § 29 were done in galilean coordinates. Besides it is sufficient simply to note that these 
calculations have a purely formal character not connected with the tensor properties of the corresponding 
quantities, like the proof of Gauss’ theorem, which has the same form (83.17) in curvilinear as in cartesian 
coordinates. 

§ One might get the notion to apply to the gravitational field the formula (94.4), substituting A = 
—(e*/16nk)G. We emphasize, however, that this formula applies only to physical systems described by 
quantities g different from the gix; therefore it cannot be applied to the gravitational field which is determined 
by the quantities gix themselves. Note, by the way, that upon substituting G in place of A in (94.4) we would 
obtain simply zero, as is immediately clear from the relation (95.3) and the equations of the field in vacuum. 
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first derivatives of the g;, vanish (the g;, need not, for this, necessarily have their galilean 
values). Then at this point the second term in equation (96.1) vanishes, and in the first 
term we can take ./—g out from under the derivative sign, so that there remains 


ô 
aa T= 0, 


or, in contravariant components, 


Quantities T*, identically satisfying this equation, can be written in the form 


- 9 nik! 
ox 


ik 


where the 5" are quantities antisymmetric in the indices k, /; 
nit! = — 
Actually it is not difficult to bring T* to this form. To do this we start from the field 
equation 


and for R we have, according to (92.1) 


Rit = L gimgtogi 891» + gn _ Im _ a'I mp 
2 ôx" Ox" ax! Ax? Ox"Ox" — Gx! Ax" 


(we recall that at the point under consideration, all the r}, = 0). After simple transforma- 
tions the tensor 7" can be put in the form 


a fht ; 
TK = — ik g"— il km } 

The expression in the curly brackets is antisymmetric in k and /, and is the quantity 
which we designated above as n“!. Since the first derivatives of gą are zero at the point 
under consideration, the factor 1/(—g) can be taken out from under the sign of differentia- 
tion 0/Ox'. We introduce the notation 


ri = sain (96.2) 
Am = n — (— agg im —gi'gk™) (96.3) 
The quantities h are antisymmetric in k and /: 
h ikl — pit. (96.4) 
Then we can write 
oni 
<> =(-g)T*. 


ex! 
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This relation, derived under the assumption 0g,,/@x' = 0, is no longer valid when we go 
to an arbitrary system of coordinates. In the general case, the difference dh'"'/Ax' — ( — g)T* 
is different from zero; we denote it by (—g)t™. Then we have, by definition, 


n TM EE ÔR 
(-9gY(T"*17) = Tr. (96.5) 
Óx 
The quantities 1 * are symmetric in / and k: 
t = pk, (96.6) 
This is clear immediately from their definition, since like the tensor T^, the derivatives 


êh"! Cx! are symmetric quantities.* Expressing T in terms of R^, according to the Einstein 
equations, we get the identity 

Qhi* 
ôx!’ 


from which, after a rather lengthy calculation, we find the following expression for /^: 


4 
(-9) lis (R^ - ig Re = (96.7) 


; c ; : 
th = T (Orin TTA STILES Xg'g'* —g^g'")- 
Tg g" (TIE mn Sr m Tim T) + 
+ go", Iz an p rf, E Tip Tin z Dis [2 T 
+g" gT nT np Dis Tap) (96.8) 
or, in terms of derivatives of the components of the metric tensor, 
4 
i C i i um i n m 
(-g)" = l6nk (a* .g'". m— 9". 10" * mt49g "dim 9 ‘ear nT 


— (9"g,, 9^, p3 1+9" Gmn 9^. ,9"^ )) - giu g"" 9". ng", pt 
+ 4Qg"g'" — g^g'")9, 9, — I pa Inr)9" 19", m) (96.9) 
where git = y —g g*, while the index .i denotes a simple differentiation with respect to x‘. 
An essential property of the 1'* is that they do not constitute a tensor; this is clear from 
the fact that in OA'"/Ox! there appears the ordinary, and not the covariant derivative. 
However, 1'* is expressed in terms of the quantities IT}, and the latter behave like a tensor 
with respect to linear transformations of the coordinates (see § 85), so the same applies 
to the z". 
From the definition (96.5) it follows that for the sum 7*-- t'* the equation 


ô NES 
3a CC XT" +H) = 0 (96.10) 
is identically satisfied. This means that there is a conservation law for the quantities 
1 
Pint Í (-gX T" +1" ds,. (96.11) 


In the absence of a gravitational field, in galilean coordinates, £ = 0, and the integral 
we have written goes over into (1/c) f T^ dS,, that is, into the four-momentum of the 


t For just this purpose we took (—g) out from under the derivative sign in the expression for 7. If this 
had not been done, OA'*'/Qx! and therefore also /'* would turn out not to be symmetric in j and k. 
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material. Therefore the quantity (96.11) must be identified with the total four-momentum 
of matter plus gravitational field. The set of quantities r'* is called the energy-momentum 
pseudo-tensor of the gravitational field. 

The integration in (96.11) can be taken over any infinite hypersurface, including all of 
the three-dimensional space. If we choose for this the hypersurface x? = const, then Pi 
can be written in the form of a three-dimensional space integral: 


Pi =i f (oT? 40) av. (96.12) 


This fact, that the total four-momentum of matter plus field is expressible as an integral 
of the quantity (—g) (T*-- 1'*) which is symmetric in the indices i, k, is very important. It 
means that there is a conservation law for the angular momentum, defined as (see § 32)f 


Mi = f (xi dP*—x* dP’) = : Í QT" +) xT" +g) dS (96.13) 


Thus, also in the general theory of relativity, for a closed system of gravitating bodies 
the total angular momentum is conserved, and, moreover, one can again define a center of 
inertia which carries out a uniform motion. This latter point is related to the conservation 
of the components M°* (see § 14) which is expressed by the equation 


x? | ar? «£9 g) av — f (799 + 1°°-g) av = const, 
so that the coordinates of the center of inertia are given by the formula 
f x +X- g) av 


X = ; (96.14) 
f (199: 7 g) av 


By choosing a coordinate system which is inertial in a given volume element, we can 
make all the t ^ vanish at any point in space-time (since then all the Ij, vanish). On the other 
hand, we can get values of the t/* different from zero in flat space, i.e. in the absence of a 
gravitational field, if we simply use curvilinear coordinates instead of cartesian. Thus, in 
any case, it has no meaning to speak of a definite localization of the energy of the gravita- 
tional field in space. If the tensor 7;, is zero at some world point, then this is the case for 
any reference system, so that we may say that at this point there is no matter or electro- 
magnetic field. On the other hand, from the vanishing of a pseudo-tensor at some point in 
one reference system it does not at all follow that this is so for another reference system, so 
that it is meaningless to talk of whether or not there is gravitational energy at a given place. 
This corresponds completely to the fact that by a suitable choice of coordinates, we can 
“annihilate” the gravitational field in a given volume element, in which case, from what 
has been said, the pseudotensor 1‘* also vanishes in this volume element. 

The quantities P' (the four-momentum of field plus matter) have a completely definite 


t It is necessary to note that the expression obtained by us for the four-momentum of matter plus field is 
by no means the only possible one. On the contrary, one can, in an infinity of ways (see, for example, the 
problem in this section), form expressions which in the absence of a field reduce to T*, and which upon 
integration over dSx, give conservation of some quantity. However, the choice made by us is the only one for 
which the energy-momentum pseudotensor of the field contains only first (and not higher) derivatives of gu 
(a condition which is completely natural from the physical point of view), and is also symmetric, so that it is 
possible to formulate a conservation law for the angular momentum. 
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meaning and are independent of the choice of reference system to just the extent that is 
necessary on the basis of physical considerations. 

Let us draw around the masses under consideration a region of space sufficiently large 
so that outside of it we may say that there is no gravitational field. In the course of time, this 
region cuts out a "channel" in four-dimensional space-time. Outside of this channel there 
is no field, so that four-space is flat. Because of this we must, when calculating the energy 
and momentum of the field, choose a four-dimensional reference system such that outside 
the channel it goes over into a galilean system and all the 1'* vanish. 

By this requirement the reference system is, of course, not at all uniquely determined— 
it can still be chosen arbitrarily in the interior of the channel. However the P, in full accord 
with their physical meaning, turn out to be completely independent of the choice of co- 
ordinate system in the interior of the channel. Consider two coordinate systems, different in 
the interior of the channel, but reducing outside of it to one and the same galilean system, 
and compare the values of the four-momentum P! and P" in these two systems at definite 
moments of “time” x? and x'?. Let us introduce a third coordinate system, coinciding in 
the interior of the channel at the moment x? with the first system, and at the moment x'? 
with the second, while outside of the channel it is galilean. But by virtue of the law of con- 
servation of energy and momentum the quantities P' are constant (dP'/dx? = 0). This is 
the case for the third coordinate system as well as for the first two, and from this it follows 
that P! = P", 

Earlier it was mentioned that the quantities r“ behave like a tensor with respect to linear 
transformations of the coordinates. Therefore the quantities P' form a four-vector with 
respect to such transformations, in particular with respect to Lorentz transformations 
which, at infinity, take one galilean reference frame into another.t The four-momentum P' 
can also be expressed as an integral over a distant three-dimensional surface surrounding 
“all space". Substituting (96.5) in (96.11), we find 

TE anit! 
P' = zl ax dS,- 
This integral can be transformed into an integral over an ordinary surface by means of 
(6.17): 


| 
P! = =< h* afi. (96.15) 


If for the surface of integration in (96.11) we choose the hypersurface x° = const.. then 
in (96.15) the surface of integration turns out to be a surface in ordinary space. 


P = : $ hi df. (96.16) 


+ Strictly speaking, in the definition (96.11) P' is a four-vector only with respect to linear transformations 
with determinant equal to unity; among these are the Lorentz transformations, which alone are of physical 
interest. If we also admit transformations with determinant not equal to unity, then we must introduce into 
the definition of P' the value of g at infinity by writing “=g. P' in place of P' on the left side of (96.11). 

$ The quantity df,” is the "normal" to the surface element, related to the “tangential” element df™ by 
(6.11): df = kei... df'". On the surface bounding the hypersurface which is perpendicular to the x? axis 
the only nonzero components of df '"" are those with /, m = 1, 2, 3, and so df:* has only those components, 
in which one of i and k is 0. The components df,? are just the components of the three-dimensional element 
of ordinary surface, which we denote by df,. 
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To derive the analogous formula for the angular momentum, we substitute (96.5) in 
(96.13) and write A! in the form (96.2). Integrating by parts, we obtain: 


2 3kimn 2 3ilmn 
went (o 85 pF 


c Ox" 6x ur ox" 
1 i gaum BET i gam 7 ojitma 
= 2c s Ox" z =) afi, al Q ax" — 5, ax" ^m) aS, 


1 : : [7 TES 
LI ipkim _ k piim d * L f klin — 4ilkn . 
;, | ernim ars - = f s armi as, 
From the definition of the quantities 2'*'" it is easy to see that 


ilkn klin |. jilnk sinik — tink 
Ar Ae A, AT = A, 


Thus the remaining integral over dS, is equal to 


gA” 
{Ge ase man. 


Finally, again choosing a purely spatial surface for the integration, we obtain: 


; 1 ; i 
M* = : Í (x! hko* — x nio + A12) df,. (96.17) 


PROBLEM 


Find the expression for the total four-momentum of matter plus gravitational field, using 
formula (32.5). 
Solution: In curvilinear coordinates one has, in place of (32.1), 
S — [ A V —g dV dt, 


and therefore to obtain a quantity which is conserved we must in (32.5) write A V —g in place 
of A, so that the four-momentum has the form 


1 may 
P => Í -AVT +E T ál TEM dS. 
? ax 
In applying this formula to matter, for which the quantities g® are different from the gu, we can 
take V —g out from under the sign of differentiation, and the integrand turns out to be equal to 


V = Tt, where T* is the energy-momentum tensor of the matter. When applying this same 
formula to the gravitational field, we must set A = —(c*/162k)G, while the quantities g are the 
components gx of the metric tensor. The total four-momentum of field plus matter is thus equal to 


ol ; pi 0g" AG v —g) 
ax* 
Using the expression (93.3) for G, we can rewrite this expression in the form: 
ELM V5 garp VED _ r, Ko VON) as 
p=} ffr zr g TS, os ri, zd 
The second term in the curly brackets gives the four-momentum of the gravitational field in the 


absence of matter. The integrand is not symmetric in the indices i, k, so that one cannot formulate 
à law of conservation of angular momentum. 
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§ 97. The synchronous reference system 


As we know from § 84, the condition for it to be possible to synchronize clocks at dif- 
ferent points in space is that the components gy, of the metric tensor be equal to zero. If, 
in addition, go = 1, the time coordinate x? = t is the proper time at each point in space.t 
A reference system satisfying the conditions 


Joa = O, goo — 1l (97.1) 

is said to be synchronous. The interval element in such a system is given by the expression 
ds? = di? — y, dx* dx’, (97.2) 

where the components of the spatial metric tensor are the same (except for sign) as the g, ,: 


Yap = — fap. (97.3) 


In the synchronous reference system the time lines are geodesics in the four-space. The 
four-vector ut = dx'/ds, which is tangent to the world line x!, x?, x? = const, has com- 
ponents u* = 0, u? = 1, and automatically satisfies the geodesic equations: 

i 
du +Ti utu! =Ti, =0, 
ds 
since, from the conditions (99.1), the Christoffel symbols 3, and T8, vanish identically. 

It is also easy to see that these lines are normal to the hypersurfaces t = const. In fact, 
the four-vector normal to such a hypersurface, n; = ót/Óx', has covariant components 
no = 1, n, — 0. With the conditions (97.1), the corresponding contravariant components 
are also n? = 1, në = 0, i.e., they coincide with the components of the four-vector u’ which is 
tangent to the time lines. 

Conversely, these properties can be used for the geometrical construction of a syn- 
chronous reference system in any space-time. For this purpose we choose as our starting 
surface any spacelike hypersurface, i.e., a hypersurface whose normals at each point have a 
time-like direction (they lie inside the light cone with its vertex at this point); all elements of 
interval on such a hypersurface are spacelike. Next we construct the family of geodesic lines 
normal to this hypersurface. If we now choose these lines as the time coordinate lines and 
determine the time coordinate t as the length s of the geodesic line measured from the 
initial hypersurface, we obtain a synchronous reference system. 

It is clear that such a construction, and the selection of a synchronous reference system, is 
always possible in principle. Furthermore, this choice is still not unique. A metric of the 
form (97.2) allows any transformation of the space coordinates which does not affect the 
time, and also transformations corresponding to the arbitrariness in the choice of the 
initial hypersurface for the geometrical construction. 

The transformation to the synchronous reference system can, in principle, be done 
analytically by using the Hamilton-Jacobi equation; the basis of this method is the fact that 
the trajectories of a particle in a gravitational field are just the geodesic lines. 

The Hamilton-Jacobi equation for a particle (whose mass we set equal to unity) in a 


gravitational field is ae a 
4, Ot Ot 
git axl A = 1, (97.4) 


t In this section we set c = 1. 
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(where we denote the action by t). Its complete integral has the form: 
T = f(x) AQ?) (97.5) 


where f is a function of the four coordinates x' and the three parameters £*; the fourth 
constant A we treat as an arbitrary function of the three €*. With such a representation for t, 
the equations for the trajectory of the particle can be obtained by equating the derivatives 
01/0" to zero, i.e. 


n7 xa (97.6) 


For each set of assigned values of the parameters £^, the right sides of equations (97.6) 
have definite constant values, and the world line determined by these equations is one of the 
possible trajectories of the particle. Choosing the quantities £^, which are constant along 
the trajectory, as new space coordinates, and the quantity t as the new time coordinate, we 
get the synchronous reference system; the transformation which takes us from the old 
coordinates to the new is given by equations (97.5-6). In fact it is guaranteed that for such a 
transformation the time lines will be geodesics and will be normal to the hypersurfaces 
t = const. The latter point is obvious from the mechanical analogy: the four-vector — ĝt/ôx' 
which is normal to the hypersurface coincides in mechanics with the four-momentum of the 
particle, and therefore coincides in direction with its four-velocity uw, i.e. with the four- 
vector tangent to the trajectory. Finally the condition goo = 1 is obviously satisfied, since the 
derivative — dt/ds of the action along the trajectory is the mass of the particle, which we set 
equal to 1; therefore |dt/ds| = 1. 

We write the Einstein equations in the synchronous reference system, separating the 
operations of space and time differentiation in the equations. 

We introduce the notation 


OYap 

Xap = Z 97.7 
af et ( ) 
for the time derivatives of the three-dimensional metric tensor; these quantities also form a 
three-dimensional tensor. All operations of shifting indices and covariant differentiation of 
the three-dimensional tensor x,, will be done in three-dimensional space with the metric 


Yap-+ We note that the sum x; is the logarithmic derivative of the determinant y = [yal  — g: 
oy 6 

a = yh — — — In (y). 97.8 

vg = y! D = = In (9) (97.8) 


For the Christoffel symbols we find the expressions: 
To = Too = To. = 0, 

TY = dep, Tog =4x, Thy = Ab (97.9) 

where 4%, are the three-dimensional Christoffel symbols formed from the tensor y,,. A 


calculation using formula (92.7) gives the following expressions for the components of the 


+ But this does not, of course, apply to operations of shifting indices in the space components of the four- 
tensors Ry, Tx (see the footnote on p. 251). Thus T? must be understood as before to be g^'T,. F g"°Toa, 
which in the present case reduces to g^"7,, and differs in sign from y^'T,,. 
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tensor Rj, 
180 1 
Roo = — 2 ap EL 
1 
Ro, = 5 66a hs al, "m 
18 1 . | 
w= 55 Xap t n (Xap Xi — 2201 x) + Pag. 


Here P, is the three-dimensional Ricci tensor which is expressed in terms of y, , in the same 
way as R,, is expressed in terms of gą. All operations of raising indices and of covariant 
differentiation are carried out with the three-dimensional metric 7,,. 

We write the Einstein equations in mixed components: 


10 , ] z 
R? = — 55 Men a4 = Bnk (To —AT), (97.11) 
1 
R? = 5 (xf; p— xb: a) = 81k T2, (97.12) 
p B 1 0g B p 
RÈ = — Pi — ———(N y x1) 2 8nk(Th— 30^ T). (97.13) 
2Vy ôt 


A characteristic feature of synchronous reference systems is that they are not stationary: 
the gravitational field cannot be constant in such a system. In fact, in a constant field we 
would have x,, = 0. But in the presence of matter the vanishing of all the x,, would con- 
tradict (97.11) (which has a right side different from zero). In empty space we would find 
from (97.13) that all the P,,, and with them all the components of the three-dimensional 
curvature tensor P, ,,,, vanish, i.e. the field vanishes entirely (in a synchronous system with 
a euclidean spatial metric the space-time is flat). 

At the same time the matter filling the space cannot in general be at rest relative to the 
synchronous reference frame. This is obvious from the fact that particles of matter within 
which there are pressures generally move along lines that are not geodesics; the world line 
of a particle at rest is a time line, and thus is a geodesic in the synchronous reference system. 
An exception is the case of “dust” (p = 0). Here the particles interacting with one another 
will move along geodesic lines; consequently, in this case the condition for a synchronous 
reference system does not contradict the condition that it be comoving with the matter.t 
For other equations of state a similar situation can occur only in special cases when the 
pressure gradient vanishes in all or in certain directions. 

From (97.11) one can show that the determinant —g = y of the metric tensor in a syn- 
chronous system must necessarily go to zero in a finite length of time. 

To prove this we note that the expression on the right side of this equation is positive 


f Even in this case, in order to be able to choose a “synchronously comoving” system of reference, it is 
still necessary that the matter move “without rotation". 1n the comoving system the contravariant components 
of the velocity are u? = 1, u° = Q. If the reference system is also synchronous, the covariant components 
must satisfy up = 1, Ha = Q, so that its four-dimensional curl must vanish: 

_ L0 Ök — 
Hi: k HU: = ax* ex! 
But this tensor equation must then also be valid in any other reference frame. Thus, in a synchronous, but 
not comoving, system, we then get the condition curl v = 0 for the three-dimensional velocity v. 
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for any distribution of the matter. In fact, in a synchronous reference system we have for 
the energy-momentum tensor (94.9): 
To? 
T)-1T = \c+3p)+ peer 
[the components of the four-velocity are given by (88.14)]; this quantity is clearly positive. 
The same statement is also true for the energy-momentum tensor of the electromagnetic 
field (T = 0, T$ is the positive energy density of the field). Thus we have from (97.11): 


xixi <0 (97.14) 


(where the equality sign applies in empty space). 
Using the algebraic inequality f 
xps > 308) 
wc can rewrite (97.14) in the form 


a 
ap 62 «0 


a/1\_1 
ala) 2& (97.15) 


Suppose, for example, that at a certain time x; > 0. Then as ż decreases the quantity 1/x2 
decreases, with a finite (nonzero) derivative, so that it must go to zero (from positive values) 
in the course of a finite time. In other words, x2 goes to + œ, but since x% = à In y/dt, this 
means that the determinant y goes to zero [no faster than t$, according to the inequality 
(97.15)]. If on the other hand x3 < 0 at the initial time, we get the same result for increasing 
times. 

This result does not, however, by any means prove that there must be a real physical 
singularity of the metric. A physical singularity is one that is characteristic of the space- 
time itself, and is not related to the character of the reference frame chosen (such a singularity 
should be characterized by the tending to infinity of various scalar quantitiés—the matter 
density, or the invariants of the curvature tensor). The singularity in the synchronous 
reference system, which we have proven to be inevitable, is in general actually fictitious, 
and disappears when we change to another (nonsynchronous) reference frame. Its origin 
is evident from simple geometrical arguments. 

We saw earlier that setting up a synchronous system reduces to the construction of a 
family of geodesic lines orthogonal to any space-like hypersurface. But the geodesic lines 
of an arbitrary family will, in general, intersect one another on certain enveloping hyper- 
surfaces—the four-dimensional analogues of the caustic surfaces of geometrical optics. 
We know that intersection of the coordinate lines gives rise to a singularity of the metric in 
the particular coordinate system. Thus there is a geometrical reason for the appearance of a 
singularity, associated with the specific properties of the synchronous system and therefore 
not physical in character. In general an arbitrary metric of four-space also permits the 
existence of nonintersecting families of geodesic lines. The unavoidable vanishing of the 
determinant in the synchronous system means that the curvature properties of a real (non- 
flat) space-time (which are expressed by the inequality R9 > 0) that are permitted by the 


or 


1 Its validity can easily be seen by bringing the tensor x? to diagonal form (at a given instant of time). 


290 THE GRAVITATIONAL FIELD EQUATIONS § 97 


field equations exclude the possibility of existence of such families, so that the time lines in a 
synchronous reference system necessarily intersect one another.t l 

We mentioned earlier that for dustlike matter the synchronous reference system can also 
be comoving. In this case the density of the matter goes to infinity at the caustic—simply 
as a result of the intersection of the world trajectories of the particles, which coincide with 
the time lines. It is, however, clear that this singularity of the density can be eliminated by 
introducing an arbitrarily small nonzero pressure of the matter, and in this sense is not 
physical in character. 


PROBLEMS 
1. Find the form of the solution of the gravitational field equations in vacuum in the vicinity of a 
point that is not singular, but regular in the time. 


Solution: Having agreed on the convention that the time under consideration is the time origin, 
we look for yas in the form: 


Jas = Quetthastt Cast n (1) 


where Gas, bas, Cas are functions of the space coordinates. In this same approximation the reciprocal 
tensor is: 


y? = a*^ — rtb*? + t 2(b*sb5 — c°”), 


where a*? is the tensor reciprocal to aas, and the raising of indices of the other tensors is done by 
using a°’, We also have: 


Xag = bag + 2tCag, x1 = bf t(2cl — bpb”). 
The Einstein equations (97.11-13) lead to the following relations: 


R? = —c +1bfb5=0, (2) 
R? = b5, — bu) + r[- Cia + jb!) + chis + OED.» — Hoth). g) = 0, (3) 
Ro = —P! —}b8b+4b7 bf — cf =0 (4) 


(b = bt, c = c$). The operations of covariant differentiation are carried out in the three-dimensional 
space with metric aas; the tensor Pas is also defined with respect to this metric. 

From (4) the coefficients cas are completely determined in terms of the coefficients aus and bas. 
Then (2) gives the relation 


P+ 4b?—43b' 63 =0. (5) 
From the terms of zero order in (3) we have: 
bE. a = bea. (6) 


The terms ~¢ in this equation vanish identically when we use (2) and (4)-(6) and the identity 
P5 ,— iP., [see (92.10)). 

Thus the twelve quantities aas, bas are related to one another by the one relation (5) and the 
three relations (6), so that there remain eight arbitrary functions of the three space coordinates. 
Of these, three are related to the possibility of arbitrary transformations of the three space co- 


t For the analytic construction of the metric in the vicinity of a fictitious singularity in a synchronous 
reference system, see E. M. Lifshitz, V. V. Sudakov and I. M. Khalatnikov, JETP 40, 1847, 1961, (Soviet 
Phys.—JETP, 13, 1298, 1961). The general character of the metric is clear from geometrical considerations. 
Since the caustic hypersurface always contains timelike intervals (the line elements of the geodesic time 
lines at their points of tangency to the caustic), it is not spacelike. Furthermore, on the caustic one of the 
principal values of the metric tensor y,, vanishes, corresponding to the vanishing of the distance (6) between 
two neighboring geodesics that intersect one another at their point of tangency to the caustic. The quantity 
Ó goes to zero as the first power of the distance (/) to the point of intersection. Thus the principal value of 
the metric tensor, and with it the determinant y, goes to zero like /?. 
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ordinates, and one to the arbitrariness in choosing the initial hypersurface for setting up the 
synchronous reference system. Therefore we are left with the correct number (see the end of $ 95) 
of four “physically different" arbitrary functions. 
2. Calculate the components of the curvature tensor Ry, in the synchronous reference system. 
Solution: Using the Christoffel symbols (97.9) we find from (92.1): 


Resys = —Pasyot 1Otao X5» — Xay 55), 
Roasy = (xay: 8— Xap: r) 

1 , 
Roaos = 2 et Xag — 4 Xay Xs, 


where P.;;,, is the three-dimensional curvature tensor corresponding to the three-dimensional 
metric yas. 

3. Find the general form of the infinitesimal transformation from one synchronous reference 
system to another. 

Solution: The transformation has the form 


ttt olx, x?, x), x* x^ a(x! x?, X5, t), 


where v and ¢° are small quantities. We are guaranteed that the condition goo = | is satisfied by 
keeping e independent of ¢; to maintain the condition goa = 0, we must satisfy the equations 


. ag? 09 
LER 
‘from which 
a — 0e as af yl 2 3 
AE dt t f *(x!, x?, x9), (1) 


where the f° are again small quantities (forming a three-dimensional vector f). The spatial metric 
tensor Yas is replaced by . . 
Yap > Yap = Sa; BT op. [ 2 9X,g (2) 


{as can be easily verified using (94.3)). 
Thus the transformation contains four arbitrary functions (o, f°) of the space coordinates. 


§ 98. The tetrad representation of the Einstein equations 


The determination of the components of the Ricci tensor (and the resulting formulation 
of the Einstein equations) for a metric of some special form generally involves quite compli- 
cated calculations. It is therefore important to consider various formulas that enable one 
to simplify these calculations in some cases and to represent the result in a more easily 
understood form. Among such formulas is the expression for the curvature tensor in 
tetrad form. 

We introduce a set of four linearly independent coordinate four-vectors ej, (labeled by 
the index a) and subject only to the condition that 


€i = Mad» (98.1) 


where na is a given constant symmetric matrix with signature + — — — ; we denote the 
matrix reciprocal to na, by n**(n"*n,, = ó5).t Together with the tetrad of vectors eie, we 


+ In this section we use early Latin letters a, b, c, . . . for indices labeling coordinate vectors; four-tensor 
indices are again labeled by i, k,l... . In the literature the indices of axes are usually enclosed in brackets. 
To avoid too cumbersome writing of formulas we shall use brackets only when indices for axes appear along 
with tensor indices and omit them for quantities which by their definition involve only coordinate axis 
indices (for example, »,, and later y,,., 4,,.). A summation is understood throughout over repeated indices 
of both types. 
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also introduce a tetrad of reciprocal vectors e'^" (labeled by superscript axial indices), 
and defined by the conditions 


eet, = ds, (98.2) 


i.e., each of the vectors eí^' is orthogonal to the three vectors ef, with 6a. Multiplyin 
i g (b) ply g 


(98.2) by ek. we get (efel ein = et. from which we see that in addition to (98.2), the 
equation 
eie, dF (98.3) 


is also automatically satisfied. 
Multiplying the equation e(,)@,.); = Nae on both sides by n°, we get: 


ei" e) = oa: 
comparing with (98.2), we find that 
ei? = ei. ey = Moet. (98.4) 


Thus the lowering and raising of tetrad indices is done with the matrices n,- and n°. The 
importance of the tetrads thus introduced is that one can express the metric tensor in 
terms of them. In fact, according to the definition of the relation between co- and contra- 
variant components of a four-vector, we have ej^ = gj,e'^': multiplying by eax and 
using (98.3) and (98.4), we find: 


Qu = Carer” = Navel ey. (98.5) 
The square of the line element with the metric tensor (98.5) is 
ds? = nalei dx'YejPdx^). (98.6) 


As for the arbitrarily assignable matrix 4,,, the most natural choice is the "galilean" 
form (i.e. a diagonal matrix with elements 1, —1, —1, —1); then, according to (98.1) the 
coordinate vectors will be orthogonal, with one of them timelike and the other three 
spacelike.t We emphasize, however, that this choice is by no means necessary, and that 
situations are possible where, for one or another reason (e.g., symmetry properties of the 
metric) the choice of a nonorthogonal tetrad is advantageous.? 

The tetrad components of a four-vector 4‘ (and similarly for a four-tensor of any rank) 
are defined as its “projections” on the coordinate four-vectors: 


Aq) = ea) Ar, AO = eA! = n" Aq. (98.7) 


Conversely, 
A; = ei? AG. A = euyAC, (98.8) 


+ Having chosen the linear forms dx‘) = e'" dx' as the segments of the coordinate axes in the given 
element of four-space (and taken "galilean" »,,) we bring the metric in this element to galilean form. 
We emphasize once again that the forms dx'^ are not in general exact differentials of any function of the 
coordinates. 

1 The advantageous choice of the tetrad may be dictated by a previous reduction of ds? to the form (98.6) 
Thus the expression for ds? in the form (88.13) corresponds to coordinate vectors 


e9 = (Vh, —Vhg) &=(, e“), 
where the choice of the e'*' depends on the spatial form d/?. 
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In this same way we define the operation of "differentiation along the a direction”: 


a 
. 6 
ó (a) = Ca) 20 
. ox 
We introduce quantities needed in the sequel:t 
_ i ok 
Yach F Ccariske my tes (98.9) 
and their linear combinations 
fabe = Yabe — Yaco = 
Ly ik i k 
= (uaa 7 Cares Eae) = (Eraik — Eca, i)e Eer (98.10) 


The last equation in (98.10) follows from (86.12); we note that the quantities 4,,. are 
calculated by a simple differentiation of the frame vectors. The converse expression for the 
yay. in terms of the Zape is: 


Yabe = lac + Anca = / cab): (98. 11 ) 


These quantities have symmetry properties: 


^ ^ 


Yabe = —Voacs “abe = T Aach (98.12) 


Our aim is to determine the tetrad components of the curvature tensor. To do this we 
must start from the definition (91.6) as applied to the covariant derivatives of the frame 
vectors: 

taii ^ O(a yiztsk = (a) R mikt» 
or 
i k 4l 
Riannroa = oig ~ C iaa eioan 
This expression is easily written in terms of the quantities y,,.. We write 


= (5) fc) 
É(ayisk = Vabcli ex , 
and after further covariant differentiation the derivatives of the frame vectors are again 
expressed in this same way; here the covariant derivative of the scalar quantity y,,, coincides 
with its ordinary derivative.t As a result we get 


Ryo = Yared T abd, + 
+ Ya ca = Ya) + YareY ba 7 Yara?” bes (98.13) 
where, in accordance with the general rule, y*,, = N% Yaso, etc. 


Contraction of this tensor on a pair of indices a, c gives the required tetrad components 
of the Ricci tensor; we present them as expressed in terms of the quantities Aase: 


E € 5 € c ac 
Raney = — Hia ot Aba PLE cab t ^ cb,a t 


AP A aa A aca VÀ aca A aA A Aus. (98.14) 


t The quantities y,,. are called the Ricci rotation coefficients. 
t We give for reference the transformation in similar fashion of arbitrary four-vectors and four-tensors: 


1 a E 2 (d 
A, Ma) €, = A A Vaan? 


(CELE 


k e m — gid) (a) 
Aina Siar € Ser = Aarono T A io Yaae T ai Yave’ 


etc. 
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Finally, we call attention to the fact that our construction is essentially independent 
of the four-dimensional nature of the metric. Thus the results obtained can also be used 
to calculate the three-dimensional Riemann and Ricci tensors for a three-dimensional 
metric. Naturally, then, in place of the tetrad of four-vectors we will deal with a triad of 
three-vectors, while the matrix a should have the signature + + + (we shall deal with 
such an application in § 116). 


CHAPTER 12 


THE FIELD OF GRAVITATING BODIES 


§ 99. Newton’s law 


In the Einstein field equations we now carry out the transition to the limit of non- 
relativistic mechanics. As was stated in § 87, the assumption of small velocities of all particles 
requires also that the gravitational field be weak. 

The expression for the component goo of the metric tensor (the only one which we need) 
was found, for the limiting case which we are considering, in § 87: 


2¢ 
Joo = 1+ "E 


Further, we can use for the components of the energy-momentum tensor the expression 
(35.4) T* = uc?^u;u*, where p is the mass density of the body (the sum of the rest masses 
of the particles in a unit volume; we drop the subscript 0 on y). As for the four-velocity u', 
since the macroscopic motion is also considered to be slow, we must neglect all its space 
components and retain only the time component, that is, we must set u* = 0, u? — us =l. 
Of all the components TY, there thus remains only * 


T} = uc?. (99.1) 


The scalar T = Ti will be equal to this same value pc?. 
We write the field equations in the form (95.8): 


8 1 
Rt = zk (Ti-38T); 
fori=k=0 
4nk 


One easily verifies that in the approximation we are considering all the other equations vanish 
identically. 

For the calculation of R9 from the general formula (92.7), we note that terms containing 
products of the quantities T$, are in every case quantities of the second order. Terms con- 
taining derivatives with respect to x? = ct are small (compared with terms with derivatives 
with respect to the coordinates x^) since they contain extra powers of I/c. As a result, there 
remains Roo R$ = OL'$o/0x*. Substituting 


295 
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1 2go 1 d$ 
re ~ 2l gan 9900 | 
007 748 ant ct ext 


we find 


Thus the Einstein equations give 
Ad = 4nky. (99.2) 
This is the equation of the gravitational field in nonrelativistic mechanics. It is completely 
analogous to the Poisson equation (36.4) for the electric potential, where here in place of 
the charge density we have the mass density multiplied by —k. Therefore we can immediately 
write the general solution of equation (99.2) by analogy with (36.8) in the form 


- -e fo (99.3) 


This formula determines the potential of the gravitational field of an arbitrary mass distribu- 
tion in the nonrelativistic approximation. 
In particular, we have for the potential of the field of a single particle of mass m 


$--X (99.4) 


and, consequently, the force F = —m'(0$/OR), acting in this field on another particle 
(mass m^), 1s equal to 
kmm' 
E 
This is the well-known /aw of attraction of Newton. 
The potential energy of a particle in a gravitational field is equal to its mass multiplied by 
the potential of the field, in analogy to the fact that the potential energy in an electric field is 
equal to the product of the charge and the potential of the field. Therefore, we may write, by 
analogy with (37.1), for the potential energy of an arbitrary mass distribution, the expression 


U= ; f ud dV. (99.6) 


For the Newtonian potential of a constant gravitational field at large distances from the 
masses producing it, we can give an expansion analogous to that obtained in §§ 40—41 for 
the electrostatic field. We choose the coordinate origin at the inertial center of the masses. 
Then the integral f ur dV, which is analogous to the dipole moment of a system of charges, 
vanishes identically. Thus, unlike the case of the electrostatic field, in the case of the gravita- 
tional field we can always eliminate the “dipole terms". Consequently, the expansion of the 
potential ¢ has the form: 


F-- (99.5) 


- kit ge á = 99.7 
PN hg MOX. R (94) 

where M = f u dV is the total mass of the system, and the quantity 
Dap = f u(3x, xs — r28,5) dV (99.8) 


may be called the mass quadrupole moment tensor. It is related to the usual moment of 


t We here write all indices a, f as subscripts, not distinguishing between co- and contravariant components, 
in accordance with the fact that all operations are carried out in ordinary Newtonian (Euclidean) space. 
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inertia tensor 


i= [ pe(r?5gy—XeXp) dV 
by the obvious relation 
Dap = J,,0,5 — 39,5. (99.9) 


The determination of the Newtonian potential from a given distribution of masses is the 
subject of one of the branches of mathematical physics; the exposition of the various 
methods for this is not the subject of the present book. Here we shall for reference purposes 
give only the formulas for the potential of the gravitational field produced by a homo- 
geneous ellipsoidal body. 

Let the surface of the ellipsoid be given by the equation 


254. sb (99.10) 


Then the potential of the field at an arbitrary point outside the body is given by the following 
formula: 


f y z? Nds 
= — l — 99.11 
ROS Í (1-3 M oe -z4) R, ( 
t 


R, = V (a? o s(b? & sc? +5), 
where č is the positive root of the equation 


x? y? z 
——lc YH FZ = (99.12 
até bal cë ) 


The potential of the field in the interior of the ellipsoid is given by the formula 


eo 


x? y? z? N ds 

m WIESE RIVE ME eV 99.13 

? ruabek | ( a?+s b?’+s Fak ( 
(U 


which differs from (99.11) in having the lower limit replaced by zero; we note that this expres- 
sion is a quadratic function of the coordinates x, y, Z. 

The gravitational energy of the body is obtained, according to (99.6), by integrating the 
expression (99.13) over the volume of the ellipsoid. This integral can be done by elementary 
methods,T and gives: 


U 3km? fi a? b? £) |$- 
Eg sla Das! ots R, 


_ 3km? f (s IN 2 =] 
«fg S" RJ SR, 
o 


(m -7 abc pu is the total mass of the body); integrating the first term by parts, we 


t The integration of the squares x?, 5?, z? is most simply done by making the substitution x = ax’, 
y = by’, z = cz', which reduces the integral over the volume of the ellipsoid to an integral over the volume of 
the unit sphere. 
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obtain finally: 


œ 


rur ds 
DR 


(99.14) 


All the integrals appearing in formulas TT can be expressed in terms of 
elliptic integrals of the first and second kind. For ellipsoids of rotation, these integrals are 
expressed in terms of elementary functions. In particular, the gravitational energy of an 
oblate ellipsoid of rotation (a = b > c) is 


3km? c 
U = — —— cos"! - (99.15) 
5 Va? —c? 6 
and for a prolate ellipsoid of rotation (a > b = c): 
3km? a 
U = — ———— cosh"! -. 99.16) 
5 Ja?—c? c ( 
For a sphere (a = c) both formulas give the value U = —3km?/5a, which, of course, can 


also be obtained by elementary methods.t 


PROBLEM 


Determine the equilibrium shape of a homogeneous gravitating mass of liquid which is rotating 
as a whole. 

Solution: The condition of equilibrium is the constancy on the surface of the body of the sum of 
the gravitational potential and the potential of the centrifugal forces: 


o? 
$— z (x? y?) = const. 


(Q is the angular velocity; the axis of rotation is the z axis). The required shape is that of an oblate 
ellipsoid of rotation. To determine its parameters we substitute (99.13) in the condition of equi- 
librium, and eliminate z? by using equation (99.10); this gives: 


AME NEN M. E. 
QT y?) | oven 2nukaà?c a? exse = const., 
0 0 


from which it follows that the expression in the square brackets must vanish. Performing the in- 
tegration, we get the equation 

(a? + 2c?)c - ;€ 3c? o? 25 4n 1/3 M?y 1/3 413 

(a? — c2 a -è inku 6\3 mk 
(M = £ma?Q. is the angular momentum of the body around the z axis), gem determines the 
ratio of the semiaxes c/a for given Q or M. The dependence of the ratio c/a on M is single-valued; 
c/a increases monotonically with increasing M. 

However, it turns out that the symmetrical form which we have found is stable (with respect to 
small perturbations) only for not too large values of M.t The stability is lost for M = 
2.89 k1/? mb? , 1/6 (when c/a = 0.58). With further increase of M, the equilibrium shape becomes a 
general ellipsoid with gradually decreasing values of b/a and c/a (from 1 and from 0.58, respectively). 
This shape in turn becomes unstable for M = 3.84 k!? m5’? 4171/6 (when a: b: c = 1:0.43:0.34). 


t The potential of the field inside a homogeneous sphere of radius a is 


2 
ọ = —2nzku (e-5). 


1 References to the literature concerning this question can be found in the book by H. Lamb, Hydro- 
dynamics, chap. XII. 
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§ 100. The centrally symmetric gravitational field 


_ Let us consider a gravitational field possessing central symmetry. Such a field can be 
produced by any centrally symmetric distribution of matter; for this, of course, not only the 
distribution but also the motion of the matter must be centrally symmetric, i.e. the velocity 
at each point must be directed along the radius. 

The central symmetry of the field means that the space-time metric, that is, the expression 
for thc interval ds, must be the same for all points located at the same distance from the 
center. In euclidean space this distance is equal to the radius vector; in a non-euclidean 
space, such as we have in the presence of a gravitational field, there is no quantity which has 
all the properties of the euclidean radius vector (for example to be equal both to the distance 
from the center and to the length of the circumference divided by 27). Therefore the choice 
of a “radius vector” is now arbitrary. 

If we use "spherical" space coordinates r, 0, œ, then the most general centrally symmetric 
expression for ds? is 


ds? = h(r, t) dr? +k(r, tY(sin? 0 d$? -- d0?) + I(r, t) dt? -- a(r, t) dr dt, (100.1) 


where a, h, k, lare certain functions of the "radius vector" r and the "time" t. But because 
of the arbitrariness in the choice of a reference system in the general theory of relativity, we 
can still subject the coordinates to any transformation which does not destroy the central 
symmetry of ds?; this means that we can transform the coordinates r and f according to the 
formulas 


r - fr, t), t =f2,(r', r), 


where /,, f; are any functions of the new coordinates r’, 1’. 

Making use of this possibility, we choose the coordinate r and the time ¢ in such a way 
that, first of all, the coefficient a(r, t ) of dr dt in the expression for ds? vanishes and, secondly, 
the coefficient k(r, t) becomes equal simply to —r?.t The latter condition implies that the 
radius vector r is defined in such a way that the circumference of a circle with center at the 
origin of coordinates is equal to 2zr (the element of arc of a circle in the plane 0 = 2/2 
is equal to dl = r d$). It will be convenient to write the quantities A and / in exponential 
form, as — e? and c?e" respectively, where A and v are some functions of r and t. Thus we 
obtain the following expression for ds?: 


ds? = e'c? di! — r(d0? sin? 0 d$?) — et dr?. (100.2) 

Denoting by x°, x', x?, x?, respectively, the coordinates ct, r, 0, $, we have for the 

nonzero components of the metric tensor the expressions 
Joo =e’, gau = —e’, ga = r°, 933 = -r° sin? 0. 
Clearly, 
g =e", g!! = —e7?^, g? = -r g= —rp sin ^? 0. 

With these values it is easy to calculate the Ij, from formula (86.3). The calculation leads 
to the following expressions (the prime means differentiation with respect to r, while a dot 
on a symbol means differentiation with respect to ct): 


t These conditions do not determine the choice of the time coordinate uniquely. It can still be subjected 
to an arbitrary transformation t = f(t’), not containing r. 
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ri -5, Tto 7 > I2, = —sin 0 cos 0, 
A vy .- 
Ti =- e”, ri = —re~’, Too = 2 e ^ 
1 " (100.3) 
Ii-2Ii--, T}, 2 cot 0, I9, = -. 
r 2 
À 
Tio=5> Ti, = —rsin? Qe^?. 


All other components (except for those which differ from the ones we have written by a 
transposition of the indices k and /) are zero. 

To get the equations of gravitation we must calculate the components of the tensor R; 
according to formula (92.7). A simple calculation leads to the following equations: 


dis ri- e (Le 5) 45, (100.4) 

= Ti -= Ti = -jeah tt eee ) «ye : - 2) (100.5) 
ink To- -e(-e (100.6) 

s Ti- -ed (100.7) 


The other components of (95.6) vanish identically. Using (94.9). the components of the 
energy momentum tensor can be expressed in terms of the energy density e of the matter, 
its pressure p, and the radial velocity v. 

The equations (100.4-7) can be integrated exactly in the very important case of a centrally 
symmetric field in vacuum, that is, outside of the masses producing the field. Setting the 
energy-momentum tensor equal to zero, we get the following equations: 


"ay 4 
ems JEERIS (100.8) 
r r r 
A! 1 1 
-À ll. A = 
e E a)+ 4, =0, (100.9) 
Àz0 (100.10) 


[we do not write the fourth equation, that is, equation (100.5), since it follows from the 
other three equations]. 

From (100.10) we see directly that 2 does not depend on the time. Further, adding 
equations (100.8) and (100.9), we find A’+v’ = 0, that is, 


À v = f(t), (100.11) 


where f(t) is a function only of the time. But when we chose the interval ds? in the form 
(100.2), there still remained the possibility of an arbitrary transformation of the time of the 
form ¢ = f(t’). Such a transformation is equivalent to adding to v an arbitrary function of 
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the time. and with its aid we can always make f(r) in (100.11) vanish. And so, without any 
loss in generality, we can set ¿+v = 0. Note that the centrally symmetric gravitational 
field in vacuum is automatically static. 

The equation (100.9) is easily integrated and gives: 


» const 
eta=e alt : 


(100.12) 


r 


Thus, at infinity (r > oo), e^ ^ = e" = 1, that is, far from the gravitating bodies the metric 
automatically becomes galilean. The constant is easily expressed in terms of the mass of the 
body by requiring that at large distances, where the field is weak, Newton's law should 
hold.t In other words, we should have goo = 1+(2¢/c?), where the potential ¢ has its 
Newtonian value (99.4) @ = —(kmjr) (m is the total mass of the bodies producing the field). 
From this it is clear that const = — (2km/c?). This quantity has the dimensions of length; 
it is called the gravitational radius r, of the body: 


2km 
9 A 


(100.13) 


Thus we finally obtain the space-time metric in the form: 


dr? 


ds? = (: -^ c? di? — r*(sin? 0 d$? +d0?)— (100.14) 
" 


pas 

* 
This solution of the Einstein equations was found by K. Schwarzschild (1916). It completely 
determines the gravitational field in vacuum produced by any centrally-symmetric distribu- 
tion of masses. We emphasize that this solution is valid not only for masses at rest, but also 
when they are moving, so long as the motion has the required symmetry (for example, a 
centrally-symmetric pulsation). We note that the metric (100.14) depends only on the total 
mass of the gravitating body, just as in the analogous problem in Newtonian theory. 

The spatial metric is determined by the expression for the element of spatial distance: 
dr? 


d? = 


4- r*(sin? 0 d$? + d0’). (100.15) 


1- -4 
r 


The geometrical meaning of the coordinate r is determined by the fact that in the metric 
(100.15) the circumference of a circle with its center at the- center of the field is 2zr. But the 
distance between two points r, and r; along the same radius is given by the integral 


r1 
Í dr 
5 a led 
r 
Furthermore, we see that goo < 1. Combining with the formula (84.1) dt = V doo dt, 
defining the proper time, it follows that 


d: < dt. (100.17) 


»n-n. (100.16) 


+ For the field in the interior of a spherical cavity in a centrally symmetric distribution, we must have 
const = 0, since otherwise the metric would have a singularity at r = 0. Thus the metric inside such a cavity 
is automatically galilean, i.e., there is no gravitational field in the interior of the cavity (just as in Newtonian 
theory). 
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The equality sign holds only at infinity, where t coincides with the proper time. Thus at 
finite distances from the masses there is a "slowing down" of the time compared with the 
time at infinity. 

Finally, we present an approximate expression for ds? at large distances from the origin 
of coordinates: 


2k 
ds? = ds? — — Gr +e? di^). (100.18) 


The second term represents a small correction to the galilean metric dsg. At large distances 
from the masses producing it, every field appears centrally symmetric. Therefore (100.18) 
determines the metric at large distances from any system of bodies. 

Certain general considerations can also be made concerning the behavior of a centrally 
symmetric gravitational field in the interior of the gravitating masses. From equation (100.6) 
we see that for r+0, 2 must also vanish at least like r?; if this were not so the right side 
of the equation would become infinite for r0, that is, T9 would have a singular point at 
r = 0, which is physically impossible. Formally integrating (100.6) with the limiting con- 
dition 4|, 9 = 0, we obtain 


8zk 
i= -mdi m T?r? ar}. (100.19) 
ctr 
Since, from (100.10), 79 = e`"Too>0, it is clear that A>0, that is, 
e^z 1. (100.20) 
Subtracting equation (100.6) term by term from (100.4), we get: 
v? 
LE 
-1 Bak e+p ( ;) 
tuam (opa 64 s 
r c v 
m 


ie. v'4-À z 0. But for r= œ (far from the masses) the metric becomes galilean, i.e. 
y 2.0, 4 — 0. Therefore, from v'4- 4' > 0 it follows that over all space 


v+4 <0. (100.21) 
Since 4 > 0, it then follows that v < 0, i.e. 
e' « 1. (100.22) 


The inequalities obtained show that the above properties (100.16) and (100.17) of the 
spatial metric and the behaviour of clocks in a centrally symmetric field in vacuum apply 
equally well to the field in the interior of the gravitating masses. 

If the gravitational field is produced by a spherical body of "radius" a, then for r>a, 
we have T$ = 0. For points with r>a, formula (100.19) therefore gives 


a 
82k co 
= —ln {1- ED f Tor ar}. 
0 


On the other hand, we can here apply the expression (100.14) referring to vacuum. according 
to which 
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Equating the two expressions, we get the formula 
mz Tor? dr, (100.23) 


expressing the total mass of a body in terms of its energy-momentum tensor. 
In particular, for a static distribution of matter in the body we have T? = e, so that 


m = — | er°dr. (100.24) 


We call attention to the fact that the integration is taken with respect to 4zr^dr, whereas 
the element of spatial volume for the metric (100.2) is dV = 4nr?e^"? dr, where, according 
to (100.20), &^ ?» 1. This difference indicates the gravitational mass defect of the body. 


PROBLEMS 


1. Find the invariants of the curvature tensor for the Schwarzschild metric (100.14). 

Solution: A calculation with (92.1) and the T,, from (100.3) (or using the formulas found in 
problem 2 of 3 92) leads to the following values for the nonzero components of the curvature 
tensor: 


R alg R Roso3 r(r—r,) 
0 mt 020 TE ADAE FII 
0101 p 202 sin? 0 2r? 3 
Risia r : 
Riai2 = = : R323 = Wt, sin? 6. 


sin?@ | 2(r—r,) 


For the invariants Z, and 7; of (92.20) we find: 


r; 2 Fy \2 
TORRO 


(products including the dual tensor R,,,,, are identically zero). The curvature tensor belongs to 
type D of the Petrov classification (with real invariants 4") = 2? = —r,/[2r?). We note that the 
curvature invariants have a singularity only at the point r = 0, and not at r — r,. 

2. Determine the spatial curvature for this metric. 

Solution: The components of the spatial curvature tensor Pasys can be expressed in terms of the 

components of the tensor Pes (and the tensor 7.5) so that we need only calculate Pas (see problem 1 
in 8 92). The tensor Pes is expressed in terms of yas just as Rix is expressed in terms of gix. Using 
the values of y«s from (100.15), we find from the calculations: 
Fg 
273" 
and P! — 0 for a + f. We note that P$, P? 0, P; < 0, while P =P! —0. 

From the formula given in problem 1 of 8 92, we find: 

Prere = (PHP a)r Yoo = ERPS Yrr Yoo, 

Poro = —P$ yn Yos 

Posso = —P; Yoo Yoo. 
It then follows (see the footnote on p. 264) that for a “plane” perpendicular to the radius, the 
Gaussian curvature is 


P{=P3= 


P, 
= = P>, 
Yoo Yoo 
(which means that, in a small triangle drawn on the “plane” in the neighborhood of its intersection 


with the radius perpendicular to it, the sum of the angles of the triangle is greater than 7). As to 
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the "planes" which pass through the centre, their Gaussian curvature K<0; this means that.the 
sum of the angles of a small triangle in such a "plane" is less than z (however this does not refer 
to the triangles embracing the centre—the sum of the angles in such a triangle is greater than z). 
3. Determine the form of the surface of rotation on which the geometry would be the same as on 
a "plane" passing through the origin in a centrally symmetric gravitational field in vacuo. 
Solution: The geometry on the surface of rotation z — z(r) is determined (in cylindrical co- 


ordinates) by the element of length: 
dl? = dr? + dz? r?dg? = dr?(1 + 2?) + r?dg?. 
Comparing with the element of length (100.15) in the "plane" 6 = 2/2 
2 — pdo? dr? 
dl? —r?dg?-- —- , 
1 — rjr 


we find 


from which m 
z=2Vr,(r—r,). 


For r = r, this function has a singularity—a branch point. The reason for this is that the spatial 
metric (100.15) in contrast to the space-time metric (100.14), actually has a singularity at r = r,. 

The general properties of the geometry on "planes" passing through the center, which were 
mentioned in the preceding problem, can also be found by considering the curvature in the pictorial 
model given here. 

4. Transform the interval (100.14) to such coordinates that its element of spatial distance has 
conformal—euclidean form, i.e. d/? is proportional to its euclidean expression. 


Solution: Setting 
ER 1 i Ts ; 
r 4 p, 


we get from (100.14) 


a 
4 
ds? = x c di?— ( -} &) (dp? 4- p? dU? =- p? sin? 0 de?). 
I4 
4p 


The coordinates p, 0, 9 are called isotropic spherical coordinates; instead of them we can also 
introduce isotropic cartesian coordinates x, y, z. In particular, at large distances (p > r,) we have 
approximately: 


ds? = ( - 2) cd? — ( + r) (dx? + dy? + dz?) 
p p 

5. Find the equations for a centrally symmetric gravitational field in matter in the comoving 
reference system. 

Solution: We make use of the two possible transformations of the coordinates r, ! in the element 
of interval (100.1) in order to, first, make the coefficient alr, 1) of dr dt vanish, and, second, to make 
the radial velocity of the matter vanish at each point (because of the central symmetry the other 
components are not present). After this is done, r and ¢ can still be subjected to an arbitrary trans- 
formation of the form r = r(r’) and t = i(t’). 

We denote the radial coordinate and time selected in this way by R and r, and the coefficients 
h, k, L by —e^, —e", e”, respectively (where 4, u and v are functions of R and zt.) We then have 
for the line element: 

ds? = c'e*dv? —e^dR? —e*(d6? -sin? 0 d$?). (1) 


In the comoving reference system the components of the energy-momentum tensor are: 


Te =e, T =T} =T} = p. 


0 
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The calculation gives the following field equations:t 


7 T! = ot p = łem? (5 tuv eigi +B) e (2) 
— ae m p = fer A2” +w? 24" ru? — pW —v 9h) 
dec Qs + oi Aju —2À —2? —24 h’), (3) 
~ T? = x e eA (wr ne (4i +e te (4) 
UM Ty = 0 = je Hin Ju vi) (5) 


(where the prime denotes differentiation with respect to R and the dot with respect to cr). 

General relations for 2, & and v can be found easily if we start from the equations T. =0 
which are contained in the field equations. Using formula (86.11), we get the following two equa- 
tions: 


; A 2€ 2p’ 
À424 = -—, v=- 6 
+24 pre » ree (6) 
If p is known as a function of c, equation (6) can be integrated in the form: 
de dp 
A+2u = —2 [onm v= EAT (7) 


where the functions f, (R) and f(t) can be chosen arbitrarily in view of the possibility mentioned 
above of making arbitrary transformations of the form R = R(R’), t = t(t’). 

6. Find the equations determining the static gravitational field in vacuum around an axially 
symmetric body at rest (H. Weyl, 1917). 

Solution: The static line element in cylindrical space coordinates x! = $, x? — o, x? =z is 
assumed to have the form 


ds? = e*c!dr! —e® dj? —e"(de? +dz?), 


where v, w, u are functions of o and z; such a representation fixes the choice of coordinates to 
within a transformation of the form e = o(o', z^), z = z(e’, z^), which multiplies the quadratic 
form de? +dz? simply by a common factor. 

From the equations 


R$ = de^" Qv, v unu ou) 2v... v (v, 0,)] = 0, 
Ri = few pp tou, +o p)t+2w,,, to, (v, +w,,)] = 0 


(where the subscripts ,o and ,z denote differentiation with respect to o and z), and taking their 
sum, we find: 

Cpp Te. = 0, 
where 


vtw 


o(e,z) =e 7. 


Thus e'(o, z) is a harmonic function of the variables o, z. According to the well-known properties 
of such functions, this means that there exists a conjugate harmonic function z'(o, z) such that 
oe’ +iz’ = f(p-- iz), where f is an analytic function of the complex variable o + iz. If now we choose 
£^, z’ as new coordinates, because of the conformality of the transformation o, z—0'z' we will have 


e"(do? +dz*) = e"'(do'? +dz’?), 


+ The components R,, can be calculated directly as was done in the text or with formulas obtained in 
problem 2 of § 92. 
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where 4'(o', z') is some new function. At the same time e” =: o^?e-"; writing e +r = y and dropping 
the primes, we write ds? in the form : 


ds? = e*c?dt? - -ge-* dd? --e?~ "(do? t dz?), (1) 
Forming the equations R} = 0, R3— R2 = 0, R? = 0 for this metric, we find: 
10 QrN er 
TIGRE e 
ôy Ov Gy ôy rN? SÕN? a 
> = - = =0 x -| > . 
ó: "0er ðe * | à.) (z) | ) 


We note that (2) has the form of the Laplace equation in cylindrical coordinates (for a function 
independent of $). If this equation is solved, then the function (o, z) is completely determined by 
eqs. (2)-(3). At large distances from the body producing the field, the functions » and y should 
tend to zero. 


§ 101. Motion in a centrally symmetric gravitational field 


Let us consider the motion of a particle in a centrally symmetric gravitational field. As 
in every centrally symmetric field, the motion occurs in a single “plane” passing through 
the origin; we choose this plane as the plane 0 = 2/2. 

To determine the trajectory of the particle, we use the Hamilton-Jacobi equation: 


where m is the mass of the particle (and we denote the mass of the central body by m"). Using 
the g^ given in the expression (100.14), we find the following equation: 


rA faS\? rA/0SV lf/0SV 
pe) (99 Y (or SY -LBY me = 0, 101.1) 
6-2) GS) C22) (3) ^ cou 


where r, = 2km'/c? is the gravitational radius of the central body. 
By the general procedure for solving the Hamilton-Jacobi equation, we look for an 
S in the form 
S = -t+ Méó  S,(r), (101.2) 


with constant energy ŝo and angular momentum M. Substituting (101.2) in (101.1), we 
find the derivative dS, /dr, and thus: 


(101.3) 


The dependence r = r(t) is given (cf. Mechanics, § 47) by the equation 0S/éé) = const, 


from which 
P dr 
049 | 7—ÁATTISROT EAT Wu (101.4) 
zm &) -(r« M y -5) 
r mc? m?c?r? r 


The trajectory itself is determined by the equation 0S/@M = const, so that 
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s= | E? M? "XN 
ee Gees ee 


r 


(101.5) 


This integral reduces to an elliptic integral. 

For the motion of a planet in the field of attraction of the Sun, the relativistic theory 
leads to only an insignificant correction compared to Newton’s theory, since the velocities 
of the planets are very small compared to the velocity of light. Jn the integrand in the equation 
(101.5) for the trajectory, this corresponds to a small value for the ratio r,/r, where r, is the 
gravitational radius of the Sun.t 

To calculate the relativistic corrections to the trajectory, it is convenient to start from 
the expression (101.3) for the radial part of the action, before differentiation with respect 
to M. 

We make a transformation of the integration variable, writing 


r(r—-n)-r?, ie. r- Far, 


as a result of which the term with M? under the square root takes the form M?/r’. In the 
other terms we make an expansion in powers of r,/r’, and obtain to the required accuracy: 


l e"? 1 1 3 24:22 1/2 

S, = Í (ae m+ 5 + ; (2m°m'k +48'mr)— Fe (w- Y o) dr, (101.6) 
[4 E R 

where for brevity we have dropped the prime on r’ and introduced the non-relativistic 

energy 6” (without the rest energy). 

The correction terms in the coefficients of the first two terms under the square root have 
only the not particularly interesting effect of changing the relation between the energy and 
momentum of the particle and changing the parameters of its Newtonian orbit (ellipse). But 
the change in the coefficient of 1/r? leads to a more fundamental effect—to a systematic 
(secular) shift in the perihelion of the orbit. 

Since the trajectory is defined by the equation ¢+(¢S,/0M) = const, the change of 
the angle $ after one revolution of the planet in its orbit is 

ô 
Ag = — > AS, 
$ eM 
where AS, is the corresponding change in S,. Expanding S, in powers of the small correction 
to the coefficient of 1/r?, we get: 
2 
AS. = AS% 3m?cr; vAS,”” 
d d 4M ôM 

where AS) corresponds to the motion in the closed ellipse which is unshifted. Differentiating 
this relation with respect to M, and using the fact that 


a 
— i AS) = Ag) = 2n, 
am 4° $ n 


we find: 
3nm?c?r 6nk?m?^m'? 
SLE open UBL 
2M? cM? 


Ag = 2n+ 


f For the Sun, r, = 3 km; for Earth, r, = 0.9 cm. 
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The second term is the required angular displacement ó$ of the Newtonian ellipse during 
one revolution, i.e. the shift in the perihelion of the orbit. Expressing it in terms of the length 
a of the semimajor axis and the eccentricity e of the ellipse by means of the formula 


M? 
pmm? 7 0789 
we Obtain: 
6nkin' 
6¢ -—————— 101.7 
$ c'a(1— e?) ( ) 


Next we consider the path of a light ray in a centrally symmetric gravitational field. This 
path is determined by the eikonal equation (87.9) 


A 


Ox x* ^ 

which differs from the Hamilton-Jacobi equation only in having m set equal to zero. There- 
fore the trajectory of the ray can be obtained immediately from (101.5) by setting m = 0; at 
the same time, in place of the energy &) = —(05S/ót) of the particle we must write the 
frequency of the light, wọ = —(0y/8t). Also introducing in place of the constant M a 
constant o defined by o = cM/og, we get: 


b= [= (101.8) 


Pf h(i" 
ot r r 


If we neglect the relativistic corrections (r, > 0), this equation gives r = g/cos @, i.c. a 
straight line passing at a distance g from the origin. To study the relativistic corrections, 
we proceed in the same way as in the previous case. 

For the radial part of the eikonal we have [see (101.3)]: 


| ® r? "am 
v = n Í Jo o r(r-r) ar. 


Making the same transformations as were used to go from (101.3) to (101.6), we find: 


Expanding the integrand in powers of r,/r, we have: 


Jpn yrs e [de a yoy ten 
r r N r c 


[4 


where i/(9) corresponds to the classical straight ray. 
The total change in y, during the propagation of the light from some very large distance R 
to the point r = g nearest to the center and then back to the distance R is equal to 


-ı R 
Ay, = A) +2 “872 cosh™! =, 
c 0 
The corresponding change in the polar angle ¢ along the ray is obtained by differentiation 


1 Numerical values of the shifts determined from formula (101.7) for Mercury and Earth are equal, 
respectively, to 43.0" and 3.8" per century. 
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with respect to M = go/oc: 


dd OA, | QAO? 2r,R 

ôM 0M ov R? — g? 
Finally, going to the limit R — oo, and noting that the straight ray corresponds to Ag = z, 
we get: 


2 
Aó-nt mE 
e 


This means that under the influence of the field of attraction the light ray is bent: its 
trajectory is a curve which is concave toward the center (the ray is "attracted" toward the 
center), so that the angle between its two asymptotes differs from z by 


EC LUE (101.9 
ge eg’ a 
in other words, the ray of light, passing at a distance ọ from the center of the field, is 
deflected through an angle 6¢.t 


§ 102. Gravitational collapse of a spherical body 


In the Schwarzschild metric (101.14), goo goes to zero and g,, to infinity at r = r, (on the 
“Schwarzschild sphere"). This could give the basis for concluding that there must be a 
singularity of the space-time metric and that it is therefore impossible for bodies to exist 
that have a “radius” (for a given mass) that is less than the gravitational radius. Actually, 
however, this conclusion would be wrong. This is already evident from the fact that the 
determinant g = —r* sin? 0 has no singularity at r, = r, so that the condition g < 0 (82.3) 
is not violated. We shall see that in fact we are dealing simply with the impossibility of 
establishing a rigid reference system for r«r,. 

To make clear the true character of the space-time metric in this domain? we make a 
transformation of the coordinates of the form: 


e= ters [OE Ratt [LÁ (102.1) 
r 
1-4 (i- 9) 
Then 
ich 
d? =; 5 (c^dc? — f^4R?)- r2(d0? sin? 0 d9?). 


We eliminate the singularity at r = r, by choosing f(r) so that f(r,) = 1. If we set f(r) = v rr. 
then the new coordinate system will also be synchronous (g,, = 1). First choosing the upper 
sign in (102.1), we have: 


1-f?)dr Xy 
p-er = [E fe aed 
E 3 s 
ec 


t For a ray just skirting the edge of the Sun, d¢ = 1.75”. 
t The physical meaning of the Schwarzschild singularity was first explained by D. Finkelstein (1958) 
using a different transformation. The metric (102.3) was first found by Lemaitre (1938). 
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or 3 2/3 
hà ($e) p (102.2) 


(we set the integration constant, which depends on the time origin, equal to zero). The 


element of interval is: 
dR? 3 4/3 5 
ds? = c?dr? — Bea — ;-e] r2? (q0? + sin? 0 d$?). (102.3) 


3 
ES (R = ct) 


In these coordinates the singularity on the Schwarzschild sphere [to which there corres- 
ponds the equality 3 (R— ct) = r,] is absent. The coordinate R is everywhere spacelike, 
while zt is timelike. The metric (102.3) is nonstationary. As in every synchronous reference 
system, the time lines are geodesics. In other words, “test” particles at rest relative to the 
reference system are particles moving freely in the given field. 

To given values of r there correspond world lines R— ct — const (the sloping straight 
lines in Fig. 20). The world lines of particles at rest relative to the reference system are 
shown on this diagram as vertical lines; moving along these lines, after a finite interval of 


proper time the particles “fall in" to the center of the field (r = 0), which is the location of 
the true singularity of the metric. 

Let us consider the propagation of radial light signals. The equation ds? = O (for 
0, @ = const) gives for the derivative dt/dR along the ray: 


dt 1 P 
EIC t7 wx 73 = + P (102.4) 
(> (R-c1)) 
2r, 


the two signs corresponding to the two boundaries of the light "cone" with its vertex at 
the given world point. When r >r, (point a in Fig. 20) the slope of these boundaries 
satisfies |c dr/dR| < 1, so that the straight line r = const (along which c dt/dR = 1) falls 
inside the cone. But in the region r < r, (point a’) we have |c dt/dR| > 1, so that the line 
r = const, the world line of a particle at rest relative to the center of the field, lies outside 
the cone. Both boundaries of the cone intersect the line r = 0 at a finite distance, approach- 
ing it along a vertical. Since no causally related events can lie on the world line outside the 
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light cone, it follows that in the region r « r, no particles can be at rest. Here all interactions 
and signals propagate in the direction toward the center, reaching it after a finite interval 
of time r. 

Similarly, choosing the lower signs in (102.1) we would obtain an "expanding" reference 
system with a metric differing from (102.3) by a change of the sign of t. It corresponds to a 
space-time in which (in the region r < r,) again rest is impossible, but all signals propagate 
outward from the center. 

The results described here can be applied to the problem of the behavior of massive 
bodies in the general theory of relativity. 

The investigation of the relativistic conditions for equilibrium of a spherical body shows 
that for a body of sufficiently large mass, states of static equilibrium cannot exist.T It is 
clear that such a body must contract without limit (i.e. it must undergo "gravitational 
collapse" ).i 

In a reference system not attached to the body and galilean at infinity [metric (100.14)], 
the radius of the central body cannot be less than r,. This means that according to the clocks 
t of a distant observer the radius of the contracting body only approaches the gravitational 
radius asymptotically as t — oo. It is easy to find the limiting form of this dependence. 

A particle on the surface of the contracting body is at all times in the field of attraction of 
a constant mass m, the total mass of the body. As r > r, the gravitational force becomes very 
large; but the density of the body (and with it, the pressure) remains finite. Neglecting the 
pressure forces for this reason, we reduce the determination of the time dependence r = r(t ) 
of the radius of the body to a consideration of the free fall of a test particle in the field of 
the mass m. 

The function r(1) for fall in a Schwarzschild field is given by the integral (101.4), where 
for purely radial motion M = 0. Thus, if the fall starts at a “distance” ry from the center 


with zero velocity at some time fg, the energy of the particle is &y = me? 1 —r,/ro, and 
for the time ¢ for it to reach the "distance" r we have: 


To dr 
hoc E 
c(t— tg) = zd j LEE S 
ro 1- =} /--—- 
: r F Po 


This integral diverges like r, In (r—r,) for r2r,. Thus we find the asymptotic formula for 
the approach of r to r,: 


(102.5) 


r—r, = const e^ o), (102.6) 


Thus the final stage of approach of the collapsing body to the gravitational radius occurs 
according to an exponential law with a very small characteristic time ~r,/c. 

Although the rate of contraction as observed from outside goes to zero asymptotically, 
the velocity v of fall of the particles, as measured in their proper time, increases and appro- 
aches the velocity of light. In fact, according to the definition (88.10): 


2 911 (2) 
v = ———|—]. 
Joo V dt 
1 See Statistical Physics, 8 111. 


* The essential properties of this phenomenon were first explained by J. R. Oppenheimer and H. Snyder 
(1939). 
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Taking gı; and goo from (100.14) and dr/dt from (102.5), we find: 


U r 
1—-3- . (102.7) 
c rs 
1-2 
Fo 


The approach to the gravitational radius, which according to the clocks of the outside 
observer takes an infinite time, occupies only a finite interval of proper time (i.e. time in the 
reference system comoving with the body). This is already clear from the analysis given above, 
but one can also verify it directly by computing the proper time 7 as the invariant integral 


ct = | ds = c? 4r "i 
= = Goo dr Jii r 


Taking dr/dt for the falling particle from (102.5), we get for the proper time for fall from 


ro tor: r 
t-r = Le - z) "dr. (102.8) 
cJ\r Fo 


0 


This integral converges for r > r,. 

Having reached the gravitational radius (as measured by proper time), the body will con- 
tinue to contract, with all of its particles arriving at the center within a finite time; the 
moment of collapse of each portion of the matter into the center is a true singularity of the 
space-time metric. We do not, however, observe any of this process of collapse of the body 
within the Schwarzschild sphere. The moment when the surface of the body crosses this 
sphere corresponds to t = oo; we may say that the whole process of collapse within the 
Schwarzschild sphere occurs “after an infinite time" for the distant observer—an extreme 
example of the relativity of time. There are, of course, no logical contradictions in this 
picture. In complete accord with it is the statement above about the property of the contract- 
ing coordinate system: in this system no signals emerge from the Schwarzschild sphere. 
Particles or light rays may intersect this sphere (in the comoving reference system) only in 
one direction—toward the inside, and once crossing there, can no longer emerge. Such a 
"one-way valve" is called the event horizon. 

With respect to an external observer the contraction to the gravitational radius is 
accompanied by a “closing up" of the body. The time for propagation of signals sent from 
the body tends to infinity: for a light signal c dt = dr/(1— rj/r), the time for propagation 
from r to some rgr is given by the integral 


. dr - 
cAt pq = ro-r-r,ln 2 vs, (102.9) 
1" r—r, 
r 


r 


which [like the integral (102.5)] diverges for r—r,. Intervals of proper time on the surface 
of the body are shortened, as compared to intervals of time f for the distant observer, in 
the ratio 

VGoo = Nl nr; 
consequently, as r—r, all processes on the body appear to be "frozen" with respect to the 
external observer. 


§ 102 GRAVITATIONAL COLLAPSE OF A SPHERICAL BODY 313 


The frequency of a spectral line emitted by the body and received by a distant observer 
is reduced, but this is not only an effect of the gravitational red shift, but also the effect of 
the Doppler shift due to the motion of the source, which is falling toward the center along 
with the surface of the sphere. When the radius of the sphere is already close to r, (so that 
the velocity of fall is already close to the light velocity) this effect reduces the frequency 


by a factor 
ji P RS =i jı i 
c? c] 2 ci 


Under the influence of both effects, the observed frequency consequently goes to zero as 
rr, according to the law 


k 
w = const (1-2) (102.10) 


Thus, from the point of view of a distant observer, the gravitational collapse leads to the 
appearance of a “congealed? body which sends no signals into the surrounding space 
and interacts with the external world only through its static gravitational field. Such a 
structure is called a black hole or a collapsar. 

In conclusion we make one further remark of a methodological nature. We have seen 
that for the central field in vacuum the "system of the outside observer" that is inertial at 
infinity is not complete: there is no place in it for the world lines of particles moving inside 
the Schwarzschild sphere. The metric (102.3) is still applicable inside the Schwarzschild 
sphere, but this system too is not complete in a certain sense. Consider, in this system, a 
particle carrying out a radial motion in the direction away from the center. As t — oo its 
world line goes out to infinity, while for t > — oo it must approach asymptotically tor = r,, 
since, in this metric, within the Schwarzschild sphere motion can occur only along the direc- 
tion to the center. On the other hand, emergence of the particle from r = r, to any given 
point r » r, occurs within a finite interval of proper time. In terms of proper time the 
particle must approach the Schwarzschild sphere from inside before it can begin to move 
outside it; but this part of the history of the particle is not kept by the particular reference 
system.t 

We emphasize that this incompleteness arises only in a formal treatment of the metric 
of the field, where it is regarded as produced by a point mass. In a real physical problem, 
say the collapse of an extended body, this incompleteness does not occur: the solution 
obtained by matching the metric (102.3) with the solution inside of the matter, will, of course, 
be complete, and will describe the whole history of all possible motions of the particles. 
(The world lines of particles moving in the region r>r, in the direction toward the center 
necessarily begin from the surface of the sphere, even before its contraction to within the 
Schwarzschild sphere.) 


PROBLEMS 


I. Find the radii of circular orbits for a particle in the field of a black hole (S. A. Kaplan, 1949). 
Solution: The dependence r = r(t) for a particle moving in the Schwarzschild field is given by 


t The construction of a reference system that is not incomplete in this way is considered in problem 5 
of this section. 
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(101.4), or, in differential form: 
I dr 


| —r, r edt 


pr M? 17/2 
n» me (A ae] 


(m is the mass of the particle, r, = 2km’:c? is the gravitational radius of the central body with 
mass m’). The function U(r) plays the role of an “effective potential energy" in the sense that the 
condition &9.- U(r) determines the admissible range of the motion (in analogy to nonrelativistic 
theory). Figure 21 shows curves of U(r) for various values of the angular momentum M of the 
particle. 


! 
= g: [65 - Uto, (1) 
o 


where 


U(n/mc? 


— Á— M E 


Fic. 21. 


The radii of curvature of the orbits and the corresponding values of éo and M are determined 
by the extrema of the function U(r), where the minima correspond to stable, and the maxima to 
unstable orbits. Simultaneous solution of the equations U(r) = 6o, U'(r) = 0 gives: 


r M? 3m!c?ri 
[eiue 


r mcr? 


37A 
o = Mc J2 (1 -"), 
um r 


where the upper sign refers to stable, and the lower to unstable orbits. The stable orbit closest to 
the center has the parameters 


r=3r, M= V3 mcr,, o = 3 me?, 


The minimum radius for an unstable orbit is 3r,/2 and is reached in the limit M-+20, Eo —oc. 
Figure 22 shows the dependence of rir, on M/mcr,; the upper branch gives the radius for stable, 
and the lower for unstable orbits.t 

2. For motion in this same field determine the cross-section for gravitational capture of (a) non- 
relativistic, (b) ultrarelativistic, particles coming from infinity (Ya. B. Zel'dovich and I. D. Novikov, 
1964). 


t For comparison we recall that in a Newtonian field circular orbits would be possible (and stable) at 
any distance from the center (the radius being related to the angular momentum by the formula r = M?/km'm?. 
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20 


r/r 


[oi 


5 e 3 
M/mcrg 


Fic. 22. 


Solution: (a) For a nonrelativistic velocity ve (at infinity) the energy of the particle is &,z: mc". 
From Fig. 21 we see that the line &5— mc? lies above all the potential curves with angular momenta 
M < 2mcr,, i.e. all those with impact parameters @ < 2crj/v«. All particles with such values of o 
undergo gravitational capture: they reach the Schwarzschild sphere (asymptotically, as 7- » oc) 
and do not emerge again to infinity. The capture cross-section is 


2 
c 

a= Ánr2 (<) . 
Uo 


(b) In equation (1) of problem 1 the transition to the ultrarelativistic particle (or to a light ray) 
is achieved by the substitution mn — 0. Also introducing the impact parameter ọ = cM/&o, we get: 


| to TELE 
ECL 


r 


The limits of the radial motion (the turning points) are determined by the roots of the expression 
under the square root. They are plotted as functions of o in Fig. 23; the regions of possible motions 


I, 


We 
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correspond to the unshaded part of the plane. The curve has a minimum at the point 


3V3 3 
o= r r =r. 


9 2e 2^ 


For smaller values of the impact parameter the particle does not reach the turning point, i.e. it 
moves to the Schwarzschild sphere. Then we get for the capture cross-section 


o=— ar}. 


4 


§ 103. Gravitational collapse of a dustlike sphere 


A discussion of the course of change of the internal state of a collapsing body (including 
in the course of the process its compression below the Schwarzschild sphere) requires the 
solution of the Einstein equations for the gravitational field in a material medium. In the 
centrally symmetric case the field equations can be solved in general form if we neglect 
the pressure of the matter: p = 0(R. Tolman, 1934). Although the approximation made is 
not usually admissible in real situations, the general solution of this problem has consider- 
able methodological interest. 

As was shown in § 97, for a dustlike medium one can choose a reference system which ts 
both synchronous and comoving.t Denoting the time and the radial coordinate chosen 
in this way by t and R, we write the spherically symmetric line element in the form? 


ds? = di? - e GR? — r?(1,R)\(d0? + sin? 0 dd’). (103.1) 


The function r(z, R) is the "radius," defined so that 2zr is the circumference of a circle 
(with center at the origin). The form (103.1) fixes the choice of t uniquely, but still permits 
arbitrary transformations of the radial coordinate of the form R = R(R’). 

The calculation of the components of the Ricci tensor for this metric leads to the following 
system of Einstein equations:$ 


—e ^r? &2rp i241 = 0, (103.2) 
“rae C it 47 = 0. (103.3) 
r r 2 r 
en? I . 
-=r Qrr' +r? -mAe (rrA-- F3 +1) = 8nke, (103.4) 
2j —Àr' = 0, (103.5) 


where the prime denotes differentiation with respect to R, and the dot with respect to r. 


+ The matter must move "without rotation” (cf. the footnote on p. 288). In the present case this condition 
is surely satisfied, since spherical symmetry implies purely radial motion of the material. 

1 In this section, we set c — I. 

$ Compare problem 5 of § 100. Equations (103.2-5) are obtained, respectively, from eqs. (2)-(5) of the 
problem if we set v = 0, e" = r?, p — 0. We note that the second of eqs. (6) of this same problem gives 
v = 0, i.e, v = v(7), when p = 0; the remaining arbitrariness in the metric (1) in the choice of 7 therefore 
allows us to make v equal to zero, which again demonstrates the possibility of introducing a synchronous- 
comoving teference frame. 
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Equation (103.5) is immediately integrated over the time, giving 


"E 


e =. PUO (103.6) 


where f(R) is an arbitrary function, subject only to the condition that 1 +f>0. Substituting 
this expression in (103.2), we get 


2rř+ -Íf-0 
[substitution in (103.3) gives nothing new]. The first integral of this equation is 


i? = (r+. (103.7) 


where F(R) is another arbitrary function. Thus 


^ dr 


Toc + F 
Jit 


The function r(z, R) obtained from the integration can be written in the parametric form: 


F 
= rog (cosh n—1), t o(R)—t = Um (sinh 9 — 5) for f »0, (103.8) 
i E )  To(R) ( ) for f «0 103.9 
r= — (l —~cos T -t= — sin <0, . 
-3 nh To X ES n — sin n) for ( ) 


where t)(R) is again an arbitrary function. If f = 0, 


r -(F) [to(R)—t]* for f = 0. (103.10) 


In all cases, substituting (103.6) in (103.4) and eliminating f by using (103.7), we get the 
following expression for the matter density:t 


8nke = —. (103.11) 


Formulas (103.6~11) determine the required general solution. We remark that it depends 
on only two "physically different" arbitrary functions: although three functions, f, Fand to 
appear in it, the coordinate R can still be subjected to an arbitrary transformation R = R(R’). 
This number corresponds exactly to the fact that the most general centrally symmetric 
distribution of matter is given by two functions (the density distribution and the radial 
velocity of the matter), while a free gravitational field with central symmetry does not exist. 

Since the reference frame is comoving with the matter, to each particle of matter there 


+ The functions F, f, to have only to satisfy conditions assuring the positivity of e^, r and «. In addition 
to the condition 1—/- 0 given above, it follows that F»0. We shall also assume that F’>0, r'» 0; this 
excludes cases that lead to crossing of spherical layers of matter during their radial motion. 

* It does not, however, include the special case where r = r(7), and does not depend on R, so that eq. 
(103.5) reduces to an identity; cf. V. A. Ruban, JETP, 56, 1914 (1969); Soviet Phys. JETP, 29, 1027 (1969). 
This case does not, however, correspond to the conditions of the problem of collapse of a finite body. 
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corresponds a definite value of R; the function r(t, R) for this value of R determines the 
law of motion of the particular particle, while the derivative 7 is its radial velocity. An 
important property of the solution obtained here is that the assignment of the arbitrary 
functions appearing in it over the interval from 0 to some Ry completely determines the 
behaviour of the sphere of this radius; it does not depend on how the functions are assigned 
for R> Ry. One automatically obtains the solution of the interior problem for any finite 
sphere. The total mass of the sphere is given, according to (100.23), by the integral 


r(t, Ro) Ro 
m= 4n | er?dr = 4n ferrar. 
Substituting (103.11) and noting that F(0) = 0 (when R = 0, we must have r = 0), we find 


_ F(Ro) 
2k ' 


r, = F(Ro) (103.12) 


(r, is the gravitational radius of the sphere). 

For F = const #0, we find € = 0 from (103.11), so that the solution applies to empty 
space, i.e. it describes the field of a point mass (located at the center, the singular point 
of the metric). So, setting F = r, f = 0, ty = R, we obtain the metric (102.3).t 

Formulas (103.8~10) describe both contraction and expansion of the sphere (depending 
on the range of values taken by the parameter y); both are equally admissible for the field 
equations. The important problem of behaviour of an unstable massive body corresponds 
to contraction-—gravitational collapse. The solutions (103.8-10) are written so that contrac- 
tion occurs when r. while increasing, tends to tọ. To the moment t = To{R) there corres- 
ponds the arrival at the center of the matter with a given radial coordinate R (where we 
must have t0). 

The limiting character of the metric inside the sphere as t— t;(R) is the same for all 
three cases (103.810): 


9F\} 2FM ) 
s) (t,—03.. e^? ~(F) Fagen (103.13) 
N 


This means that all radial distances (in the comoving reference frame used here) tend to 
infinity, while tangential distances go to zero (like t— tọ).ł Correspondingly the matter 
density increases without limit:$ 


2F' 
x t o. 103.114 
8nke aren ( ) 


f The case of F = 0 (where (103.7) gives r = vV/f(s — 19) corresponds to the absence of a field; by a 
suitable transformation of variables, the metric can be brought to Galilean form. 

t The geometry on a "plane" passing through the center is that which would exist on a conical surface of 
revolution which is stretching in the course of time along its generators and at the same time contracting 
along its bounding circles. 

$ The fact that in this solution collapse occurs for any mass of the sphere is a natural consequence of 
neglecting the pressure. Clearly, as «—co, from the physical point of view the assumption that the matter 
is dustlike is never admissible, and we should use the ultrarelativistic equation of state p = «/3. It appears 
however, that the general character of the limiting laws of compression are to a large extent independent 
of the equation of state of the matter (cf. E. M. Lifshitz and-l. M. Khalatnikov, JETP 39, 149, 1960; Soviet 
Phys. JETP, 12, 108, 1961). 
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Thus. in agreement with our remarks in § 102, there is a collapse of the whole matter 
distribution in to the center. 

In the special case where the function tol R) = const (i.e. all the particles reach the center 
simultaneously), the metric inside the contracting sphere has a different character. In this 


case 
OF\' 2 5 2M F' 
ref |} (toT). es — —— (1, - 0 
3 3] 2FSN 0f 


4 


359—101) 


8nke x (103.15) 
ie. as t to. all distances, both tangential and radial. tend to zero according to the same 
law ~(to~1)*; the matter density goes to infinity like (x9 — 1) ? and. in the limit, its 
distribution becomes uniform. 

We call attention to the fact that in all cases the moment of passage of the surface of the 
collapsing sphere through the Schwarzschild sphere [r(r. Ro) = r,] has no significance 
for its internal dvnamics (described by the metric in the comoving reference frame). At 
each moment of time. however, a definite part of the sphere is already below its "event 
horizon". Just as F(R,), through (103.12), determines the gravitational radius of the 
sphere as a whole. so F(R) for any given value of R is the gravitational radius of the part 
of the sphere within the spherical surface R — const: thus this part of the sphere is deter- 
mined at each moment of time t by the condition r(t, R)< F(R). 

Finally, we show how these formulas can be used to solve the problem posed at the 
end of $ 102: to construct the most complete reference system for the field of a point mass.t 

To achieve this goal we must start from a metric in vacuum that could contain both 
contracting and expanding space-time regions. Equation (103.8) is such a solution. in 
which we must set F = const = r,. Also choosing 

l 


R -4 
S= appear cin 


r 1 /R? 
; 5 zt! (1 —cos n), 


9 


t 1/R*. M , 
- = >| z +1} (t-n+sin n); 


g 2r 


we get: 


(103.16) 


where the parameter y runs through values from 0 to 27. the time t (for a given R) decreases 
monotonically, while r increases from zero, goes through a maximum, and then again 
drops to zero. 

In Fig. 24, the lines ACB and A'C'B' correspond to the point r = 0 (parameter values 
n = 2n and y = 0). The curves AOA’ and BOB’ correspond to the Schwarzschild sphere 
r — r,. Between A'C'B' and A'OB' is the region of space-time in which only motion out 


t The case where 7ọ - const includes, in particular, the collapse of a completely homogeneous sphere— 
see the problem. 

t Such a system was first found by M. Kruskal using other variables (M. Kruskal. Phys. Rev. 119, 1743, 
1960). The form of the solution given here, in which the reference system is synchronous, is due to I. D. 
Novikov, 1963. 
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from the center is possible, while between ACB and AOB there is the region in which 
motion occurs only toward the center. 


Fic. 24. 


_ The world line of a particle that is at rest relative to this reference system is a vertical line 


(R = const). It starts from r = 0 (point a), cuts the Schwarzschild sphere at the point b, 
. 2 


reaches its farthest distance | r = r,| — + j at time t = O, after which the particle again 
r 


begins to fall in toward the Schwarzschild sphere, passes through it at point c, and arrives 
once more at r = 0 (point d) at the time 


n/R? d 
T= "o> itl . 
9 


This reference system is complete: both ends of the world line of any particle moving 
in the field are either at the true singularity r = 0, or go off to infinity. The incomplete metric 
(102.3) covers only the region to the right of the curve 404’ (or to the left of BOB’), while 
the "expanding" reference system covers the region to the right of BOB’ (or to the left 
of AOA’). The Schwarzschild reference frame with the metric (100.14) covers only the 
region to the right of BOA’ (or to the left of AOB’). 


PROBLEM 


Find the solution of the interior problem for the gravitational collapse of a dustlike homogeneous 
sphere whose material is at rest initially. 
Solution: Setting 
to = Const, f = —sin? R, F = 2a sin? R, 
we find 
r= @ sin R(1—cos y}, t—To = ao(1 —sin y) a) 


(the radial coordinate R is dimensionless and runs through values from 0 to 22). Then the density is 


8ake = ~- 6 


ae ( 
ai(1 —cos y)? 2) 
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and, for a given 7. ts independent of R, so the sphere is homogeneous. We can represent the metric 
(103.1) with r from (1) in the form 


ds? = dr? -a*(r)dR? ~sin? R(d0? —sin? 0 dd), (3) 
a = aol! -cos y). 


We call attention to the fact that it coincides with the Friedmann solution for the metric of a 
universe completely filled with uniform dustlike matter (3 112)—a co™>letely natural result, 
since a sphere cut out of a uniform distribution of matter has central symmetry.t 

The condition originally posed can be satisfied by (1) for definite choices of the constants ao, To. 
Making a change of parameter for convenience (1;—~7 ~1), we write the solution in the form 
ro Sin R 


Fo . 
: (1 —cos y), = —— (y ~s T 
2 sin Ro RS, D 2sin RU in n) (4) 


where [according to (103.12)] the gravitational radius of the sphere is r, = ro sin? Ro At the 
initial time (7 - 0, 1 = 0) the matter is at rest (5. = 0), and 2, = 237(0, Ro) is the initial circum- 
ference of the sphere. The collapse of this matter into the center occurs at the time t = zw, 2 sin Ro. 

The time 7 in the reference frame of a distant observer (the Schwarzschild frame) is related to the 
proper time 7 on the sphere by the equation 


x dr? 
dr? = (1 SIL d ; 
r Ól—ror 


where by + we mean the value r(r, Ro) corresponding to the surface of the sphere. Integration of 
this equation leads to the following expression for ¢ as a function of the same parameter 4: 


7 
cot Ro +tan 
2 | 
— In ——— ———— - cot Ro | ——— 
: n .. 2sin? Ro 
cot Ro -tan , 


ry 


O —sin | (5) 


(where the time ¢ = 0 corresponds to t = 0). The passage of the surface of the sphere through 
the Schwarzschild sphere (r(r, Ro) = r,) corresponds to the value of the parameter » determined 
by the equation 

H ] ry 


2 ro 


cos? = sin? Ro. 


As we approach this value, the time / goes to infinity, in accordance with the remarks in § 102.7 


$ 104. Gravitational collapse of nonspherical and rotating bodies 


All the results of the preceding two sections were applicable strictly to bodies that are 
rigorously spherically symmetric. Simple arguments show. however, that the qualitative 
picture of gravitational collapse remains the same for bodies with small deviations from 
spherical symmetry (A. G. Doroshkevich, Ya. B. Zel'dovich and I. D. Novikov, 1965). 

We shall consider, first, bodies whose deviation from central symmetry is related to the 
matter distribution and not to a rotation of the body as a whole. 

It is obvious that if a massive centrally symmetric body is gravitationally unstable, this 


t The metric (3) corresponds to a space of constant positive curvature. In analogous fashion, setting 
f = sinh? R, F = 2ao sinh? R, we obtain the solution corresponding to a space of constant negative curva- 
ture (3 113). 

+ The function r(7, Ro) determined from formula (4) coincides, of course, with the function calculated 
from the external metric and given by the integral (102.8). The same remark applies to the function ¢(r) 
defined by formulas (4) and (5); it coincides with that given by the integral (102.5). 
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instability will remain for small disturbances of the symmetry, so that such a body will 
collapse. Treating the weak asymmetry as a small perturbation, we can follow its develop- 
ment (in the comoving frame) during the contraction of the body. Perturbations, generally 
speaking, increase with increasing density of the body. But if the perturbations are suffi- 
ciently weak at the start of the contraction, they will still be small at the time when the body 
reaches its gravitational radius; it was pointed out in § 103 that this moment is in no way 
special for the internal dynamics of the contracting body, while its density is, of course, 
still finite.* 

Because the internal perturbations in the body are small, the perturbations of the external 
centrally symmetric gravitational field produced by it also remain small. This means that 
the surface of the "event horizon," the Schwarzschild sphere, also remains almost un- 
changed, and nothing prevents the collapsing body from crossing it (in the comoving 
reference frame). 

No information reaches the external observer concerning the further growth of per- 
turbations inside the body since no signals emerge from beyond the event horizon: the whole 
process remains “infinitely delayed" for the distant observer. From this it follows. in turn, 
that with respect to the external reference frame the gravitational field of the collapsing 
body must tend to become stationary as the body approaches the gravitational radius 
asymptotically. The characteristic time for this approach is very small (~r,/c). and during 
it we may assume that in the external space there are only those perturbations that deve- 
loped earlier in the centrally symmetric field. But all disturbances must in the course of time 
dissipate in space as gravitational waves, going out to infinity (or passing beyond the 
horizon). 

Time-independent static disturbances also cannot remain in the external gravitational 
field of a developing black hole. This conclusion can be obtained from an analysis of 
constant perturbations applied on the Schwarzschild sphere in vacuum. Such an analysis 
shows that in the static case every perturbation (which drops off at infinity) increases without 
limit as it approaches the Schwarzschild sphere of the unperturbed problem:1 but, as already 
stated, in the present case there is no reason for the appearance of large perturbations of 
the external field. 

The deviations from spherical symmetry in the density distribution of the body are 
described by quadrupole and higher multipole moments of the distribution; each of them 
gives its contribution to the external gravitational field. Our assertion means that all such 
perturbations of the external field damp out in the final stages (from the point of view of 
the external observer) of the collapse.§ The developing gravitational field of the black hole 
is again the centrally symmetric Schwarzschild field, determined solely by the total mass of 
the body. 


t The development of perturbations in a nonstationary, unbounded homogeneous distribution of matter 
is treated in 8 115 (where the formulas obtained apply equally to the cases of expansion and contraction). 
Nonuniformities of the unperturbed distribution or infinite extent of the body do not change the conclusions 
drawn there. 

1 Cf. T. Regge and J. A. Wheeler, Phys. Rev. 108, 1063 (1957). We emphasize that we are speaking of 
perturbations coming from the central body itself. The condition imposed at infinity excludes cases where 
static perturbations arise from external sources: in such cases small perturbations only distort the Schwarzs- 
child sphere without changing its qualitative properties and without producing a true space-time singularity 
on it. 

$ For this damping law, cf. R. H. Price, Phys. Rev. D, 5, 2419, 2439 (1972). During collapse, initially 
static 2'-pole perturbations of the external gravitational field damp like 1/7?! *?. 


8114  GRAVITATIONAL COLLAPSE OF NONSPHERICAL AND ROTATING BODIES 323 


The question of the ultimate fate of the body in its collapse beyond the event horizon 
(which is not observable from the external reference frame) is not completely clear. Appa- 
rently one can assert that here, too, the collapse terminates in a true singularity of the 
space-time metric, but a singularity of a completely different type than in the centrally 
symmetric case. This question is, however, not settled completely at present. 

Let us turn to the case where the weak disturbance of the spherical symmetry is associated 
not with the density distribution but with rotation of the body as a whole; the assumption 
that the perturbation is weak means that we must have a sufficiently slow rotation. All 
the previous remarks remain valid with one exception. It is clear from the start that, because 
of the conservation of the total angular momentum M of the body, the field of the black 
hole cannot depend on just the mass of the body. To this there corresponds the fact that 
among the stationary (but not static) time-independent perturbations of a centrally symmet- 
ric gravitational field there is one which does not increase without limit as r—7,. This 
perturbation is just the one that is associated with rotation of the body, and is described 
by adding to the Schwarzschild metric tensor g;, (in the coordinates x? = t, x! =r, x? = 6, 


x? = @) the small off-diagonal component:t 


2kM , , 
903 = “es sin’ 8 (104.1) 


(cf. the problem in § 105). This expression remains valid (in the external space) as the body 
- approaches the gravitational radius, and thus the gravitational field of a slowly rotating 
black hole will (to first approximation in the small angular momentum M) be a centrally 
symmetric Schwarzschild field with the small correction (104.1). This field is no longer 
static but only stationary. 

If gravitational collapse is possible for small disturbances of spherical symmetry, then 
collapse of the same character (with movement of the body beyond the event horizon) 
must also be possible in some finite range of sizeable deviations from sphericity; the condi- 
tions determining this region are not as yet established. Independent of these conditions, 
it appears that one can assert that, from the point of view of an external observer, the pro- 
perties of the structure resulting from the collapse (a rotating collapsar) are independent of 
all characteristics of the initial body except for its total mass m and angular momentum M.? 
If the body does not rotate as a whole (M = 0), then the external gravitational field of the 
collapsar is the centrally symmetric Schwarzschild field.$ : 

The gravitational field of the rotating black hole is given by the following axially sym- 
metric stationary Kerr metric: 


+ In this section, we set c = 1. 

+ To avoid misunderstandings we remind the reader that we are not considering bodies that carry an 
uncompensated electric charge. 

$ This assertion is strongly supported by the following theorem due to Israel: among all static solutions 
of the Einstein equations that are galilean at infinity and have closed single-sheeted spatial surfaces goo = 
const, f = const, the Schwarzschild solution is the only one that has a horizon (goo = 0) without a 
singularity of the space-time metric on it (for the proof cf. W. Israel, Phys. Rev. 164. 1776, 1967). 

J This solution of the Einstein equations was discovered by R. Kerr, 1963, in a different form, and reduced 
to (104.2) by R. H. Boyer and R. W. Lindquist, 1967. There is no constructive analytic derivation of the 
metric (104.2) in the literature that is adequate in its physical ideas, and even a direct check of this solution 
of the Einstein equations involves cumbersome calculations. The claim that the Kerr metric is the unique 
field for a rotating collapsar is supported by a theorem analogous to the theorem of Israel (cf. B. Carter, 
Phys. Rev. Lett. 26, 331, 1971). 
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ror - 
ds? = (- arn dr! — p? q^ — 


«(n opus ua DL 849^ 475 477 sin? dà dt, (104.2) 
p 


where we have introduced the notation 
A — r—rrea^, p? = r? c a? cos? 0, (104.3) 


while r, is again r, — 2mk. This metric depends on two constant parameters, m and a, 
whose meaning is clear from the limiting form of the metric at large distances r. To terms 
of order ~ l/r, we have: 


r rd. 
JooX1l—~, gos sin? 6; 
r r 


comparison of the first expression with (100.18) and of the second with (104.1) shows that 
m is the mass of the body, while the parameter a is related to the angular momentum M by 


M = ma (104.4) 


(M = mac in the usual units). For a = Othe Kerr metric goes over into the Schwarzschild 
metric in its standard form (100.14).t* We also call attention to the fact that the form (104.2) 
exhibits explicitly the symmetry under time reversal: this transformation (r— —1) also 
changes the rotation direction, i.e., the sign of the angular momentum (a— — a), so that 
ds? remains unchanged. 

The determinant of the metric tensor (104.2) is 


—g = p* sin? 0. (104.5) 


We also give the contravariant components g^, by introducing them into the following. 
expression for the square of the four-gradient operator: 


, 0 0 1 r,ra? é\? Ase? 
ik 7 7 Mg 2 2 58-5 0 es va — 
9 ax! dx de HU x p? pn: Xs) p (5) 


ifo 1 1 QV ana 0 0 104.6 
p^ N00] Asin? @\ — p? ad DA (104:6) 


When m = 0, in the absence of a gravitating mass, the metric (104.2) should besote 
galilean. In fact, the expression 


2 
ds? = dpe -> dr? — pèd? ~(r? +a )sin? 8 dg? (104.7) 


is the galilean metric 
2. 2 TAM 2... 2 
ds" = dt —dx —dyi-dz 


written in spatially oblate spheroidal coordinates; the transformation to cartesian coordi- 


t To terms of first order in a, the metric (104.2) for a& 1 differs from the Schwarzschild metric only in the 
term (2rga/r) sin? @d¢ dt, in agreement with our remarks above about the case of weak deviations from 
spherical symmetry. 
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nates is accomplished with the formulas: 


IEEE 
x = vr’+a’ sin Ó cos ¢, 


y = Vr? xa? sin @ sin $, 


z = rcosÓ; 
the surfaces r = const are oblate ellipsoids of rotation: 
CE Ei 
rua gr o 
The metric (104.2) has a fictitious singularity, just as the Schwarzschild metric (100.14) 
has a fictitious singularity at r — r,. But, whereas in the Schwarzschild case goo goes to 
0 and g,, to infinity simultaneously on the surface r = r,, in the Kerr metric these two 


surfaces are separated. The equality ggg = 0 holds when p? = rr,; the larger of the two 
roots of this quadratic equation is 


. 2 
ro = NU — a? cos? 0 (goo = 9). (104.8) 


The quantity g,, goes to infinity when A = 0; the larger of the two roots of this equation is 


r Ant EN 
Thor = aa (2) —a’ (gi; = ©). (104.9) 


For brevity, we shall denote the surfaces r = rg and r = r,,,, whose physical meaning is 
explained below, by S, and Shor- The surface Shor is a sphere, while Sọ is an oblate figure 
of rotation; Shor is contained inside Sg, and the two surfaces touch at the poles (8 = 0 
and 0 = z). 

As we see from (104.8-9), the surfaces So and Shor exist only when a<r,/2. When a>r,/2 
the character of the metric (104.2) changes radically and begins to show physically inadmis- 
sible properties that violate the principle of causality. 

The fact that the Kerr metric is no longer meaningful for a>r,/2 implies that the value 


ie dnd Np ce (104.10) 


gives the upper limit of possible angular momenta of a collapsar. It must be regarded as a 
limiting value that can be approached arbitrarily closely, while exact equality a = a, is 
impossible. The corresponding limiting values of the radii of the surfaces Sy and Spo, are 


ro = (0 +sin 8), Troe = É (104.11) 


We shall show that the surface S,, is the event horizon, which is passed by a moving 
particle or light ray in only one direction, toward the interior. 


t The violations manifest themselves in the appearance of closed timelike world lines: they would make it 
possible to head into the past and then later develop into the future. We point out immediately that these 
same violations would appear if we extended the Kerr metric inside of Spor, even when a < r,/2, which shows 
the physical inapplicability of this metric inside of Saor (we shall return to this point later). For this same 
reason, there is no physical interest in the surfaces defined by the two smaller roots of the quadratic equations 
goo = Q and 1/g,; = 0, which lie inside Spor; cf. B. Carter, Phys. Rev. 174, 1559 (1968). 
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As a preliminary, we show that, from the general point of view, the property of unidirec- 
tional passage of world lines of moving particles holds for any null hypersurface (i.e. a 
hypersurface whose normal at every point is a null vector). Suppose the hypersurface is 
defined by the equation f(x°, x', x?, x?) = const. Its normal is directed along the four- 
gradient n! = df/éx', so that for a null hypersurface we have n;n' = 0. This means, in other 
words, that the direction of the normal lies in the surface itself: along the hypersurface 
df = n,dx' = 0, and this equation is satisfied when the directions of the four-vectors 
dx! and n' coincide. From this same property rt;n' = 0, the element of length on the hyper- 
surface in this same direction is ds = 0. In other words, along this direction the hyper- 
surface is tangent, at the given point, to the light cone constructed on the point. Thus, 
the light cones constructed at each point of a null hypersurface (say, in the direction of 
the future) lie entirely on one side of it, and are tangent to the hypersurface (at these points) 
along one of their generators. But this means precisely that (directed into the future) the 
world lines of particles or light rays can cross the hypersurface in only one direction. 

This property of null hypersurfaces is usually physically trivial: unidirectional passage 
through these surfaces simply expresses the impossibility of motion with a velocity greater 
than the light velocity (the simplest example of this sort is the hypersurface x = z in a flat 
space-time). A nontrivial new physical situation arises when the null hypersurface does not 
extend out to spatial infinity, so that its sections r = const are closed spatial surfaces; 
these surfaces are the event horizon in the same sense as the Schwarzschild sphere was in 
the centrally symmetric gravitational field. 

What is the surface Spo, in the Kerr field? The condition n;n? = 0 for the hypersurface 
of the form f(r, 0) = const in the Kerr field has the form 


wily a (0f V l efN f(ofN? 
r(e- S22] 0 enn 
[with g* from (104.6)]. This equation is not satisfied on So, but is satisfied on Spo, (where 
0f[00 = 0, A = 0). 

Continuation of the Kerr metric inside the surface of the horizon (as was done in §§ 102, 
103 for the Schwarzschild metric) has no physical meaning. Such a continuation would 
depend on only the same two parameters (m and a) as the field outside of S,,,, from which 
it is already clear that it could have no connection with the physical question of the fate of 
the collapsing body after its passing under the horizon. The effects of nonsphericity are not 
at all damped out in the comoving reference frame, and, on the contrary, must increase 
with further contraction of the body, so there is no reason to expect that the field beyond 
the horizon should be determined solely by the mass and angular momentum of the body.t 

Let us consider the properties of the surface Sy and the space between it and the horizon 
(this region of the Kerr field is called the ergosphere). 

The fundamental property of the ergosphere is that no particle in it can remain at rest 
relative to the frame of a distant observer: when r, 0, @ = const, we have ds? «0, i.e. the 
interval is not timelike, as it should be for the world line of a particle; the variable ¢ loses 
its temporal character. Thus a rigid reference frame cannot be extended from infinity into 
the ergosphere, and in this sense the surface Sy might be called the limit of stationarity. 

The character of the motion of a particle in the ergosphere is essentially different from 


+ Mathematically, this situation manifests itself in the violation of the causality principle (mentioned 
above) when we extend the Kerr metric into the interior of Swor- 
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what we had inside the horizon of the Schwarzschild: field. In the latter case, also, particles 
could not be at rest relative to an external reference system, and we could not have r = const; 
all particles had to move radially toward the center. In the ergosphere of the Kerr field 
it is @ = const that is impossible for particles (the particles must necessarily rotate around 
the axis of symmetry of the field), while r — const is possible for a particle. Furthermore, 
particles (and light rays) can move both with increasing and decreasing r, and can emerge 
from the ergosphere into the external space. Corresponding to this last point, it is possible 
for particles coming from the external region to reach the ergosphere: the time for such 
a particle (or light ray) to reach the surface Sọ, as measured with the clocks ¢ of a distant 
observer, is finite for all of So except for :he poles. at which Sọ touches Sha; the time for 
reaching these points, just as for all points of Spa. is again. of course, infinite.t 

Because of the inevitability of rotational motion of particles in the ergosphere, the 
natural form for writing the metric in this region is: 


2 2 
ds? = (s. - 8) di?  g, dr? 4 g,,4d0? +933 C) : (104.13) 
953 933 
The coefficient of dr’, ; 
gos A 
Joo = 


E a Rene ES: SEU RENE AES 
gs, rta c ryra? sin? 0 p? 
is positive everywhere outside of Spor (and does not vanish on So); the interval ds is timelike 
when r = const, 6 = const, dọ = —(go3/g33)dt. The quantity 


Go3 rar 


933 pria?) ryra? sin? 0 


(104.14) 


plays the role of a "generalized angular velocity of rotation of the ergosphere" relative to 
the external reference system (where this direction of rotation coincides with the direction 
of rotation of the central body). 

The energy of a particle, defined as —CS/¢t, the derivative of the action with respect 
to the proper time of the particle, synchronized along the trajectory, is always positive 
(cf. § 88). But, as explained in § 88, during motion of the particle in a field independent 
of time z, the energy éo defined as —CS/ér, is conserved; this quantity coincides with the 
covariant component of four-momentum p, = mug = mgo;dx' (where m is the mass of the 
particle). The fact that the variable z (the time according to the clocks of the distant observer) 
does not have temporal character inside the ergosphere results in a peculiar situation: 
in this region goo <Q, and thus the quantity 


dt do 
Eo = M(Joou? + go?) = m | goo — 903 —— 
ds ds. 


1 The time for reaching particular points of So may also turn out to be infinite in particular cases of special 
values of the energy and angular momentum of the particle, chosen so that the radial velocity vanishes at 
the particular point of So. 

t1 We call attention to the fact that intervals of proper time for particles moving along the boundary 
of the ergosphere do not vanish along with goo. In this sense, So is not the surface of “infinite red shift”; 
the frequencies of light signals sent out from it by a moving source (here sources cannot be at rest) and obser- 
ved by a distant observer, do not go to zero. We recall that in the centrally symmetric field there could be 
neither sources at rest nor moving sources on the Schwarzschild sphere (since a null surface cannot contain 
timelike world lines). The "infinite red shift" arose there because, as r—r,, intervals of proper time dt = 4/goo 
dt (for given dt), as measured with clocks at rest relative to the reference frame, went to zero. 
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can be negative. Since, in the external space, where £ is the time, the energy ĉo cannot be 
negative, a particle with £o <0 cannot fall into the ergosphere from outside. A possible 
source for the appearance of such a particle is the breakup of a body, entering the ergo- 
sphere, into, say, two parts, one of which is captured in a "negative energy" orbit. This 
part can no longer emerge from the ergosphere, and is finally captured within the horizon. 
The second part can return into the external space; since ĉo is a conserved additive quantity. 
the energy of this part is greater than the energy of the initial body, so we get an extraction 
of energy from the rotating black hole (R. Penrose, 1969). 

Finally, we note that although the surface S, is not singular for the space-time metric, 
the purely spatial metric [in the reference frame (104.2)] does have a singularity there. 
Outside of So, where the variable 1 has temporal character, the spatial metric tensor is 
calculated from (84.7), and the element of spatial distance has the form 

2 in 
d =- dh pdo? + Asm o dọ’. (104.15) 
A |—-rrjp 


Near S, parallels (0 = const, r = const) go to infinity according to the law 2za sin? 0/ ./Goo- 
The difference in readings of clocks [cf. (88.5)] also goes to infinity here when they are 
synchronized along this closed contour. 


PROBLEMS 


1. Carry out the separation of variables in the Hamilton-Jacobi equation for a particle moving 
in the Kerr field (B. Carter, 1968). 
Solution: In the Hamilton-Jacobi equation 
, eS 6S 
“asa = 0 
Ox' x 


(m is the mass of the particle, not to be confused with the mass of the central body) with g“ from 
(104.6) the time ¢ and the angle ¢ are cyclic variables; they therefore enter in the action S in the 
form —@ot+Ld, where é, is the conserved energy and L denotes the component of the angular 
momentum along the axis of symmetry of the field. It turns out that the variables 0 and + can also 
be separated. Writing S in the form 


S =- -6t +L - Str) - Salh), (1) 


we reduce the Hamilton-Jacobi equation to two ordinary differential equations (cf. Mechanics, 


$48): 
2 
eJ + (aé, sin 0 L ) +a?m? cos? 6 = K, 
sin 0 


e D) ~{U? +a2)6o -aLy ~m’? = -K, (2) 


where K (the separation parameter) is a new arbitrary constant. The functions Sg and S, are then 
determined by simple quadratures. 
The four-momentum of the particle is 


dx a a €S 
=m- =g = ge, 
P ds P. Ox, 


Calculating the right-hand side of this equation using (1) and (2), we get the following equations: 
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mo = A L+ (a +a? ues sin? e). (3) 
d$ L FIN rra 
e ds — Xsin?8 (i X) Fo, (4) 
dr\? l A 
m? (T) Siz [(r? +a?)fo —aL} E (K +m?r?), (5) 
do? l l LN 
i( — PES —g?m? 28)-. —( ad, sin8— —— Y. 
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These integrals are the first integrals of the equations of motion (the equations of the geodesics). 
The equation of the trajectory and the time dependence of the coordinates along the trajectory 
can be found either from (3) to (6) or directly from the equations 


QS:0€, = const, OSL -- const, CS/OK = const. 


For the case of light rays, we must set m = 0 on the right sides of equations (3)-(6) and write 
Wo in place of ĉo (cf. 3 101), while we must replace the derivatives md/ds on the left sides by the 
derivatives d.d? with respect to the parameter 7, which varies along the ray (cf. the end of § 87). 

Equations (4)-(6) permit purely radial motion only along the axis of rotation of the body, as 
is already clear from symmetry arguments. From these same considerations it is clear that motion 
in a “plane” is possible only if the plane is equatorial. In that case, setting 0. = 2/2 and expressing 
K in terms of ŝo and L from the condition d6:ds = 0, we obtain the equations of motion in the 


form 
d orga, of, , rd? 
db M r ra 
g 9 & { 
ds A l a) 1 8) 
drN? l A 
m? n) = A [(r? +a?) aL]? — 5 (ads —L)? -m?r?]. (9) 
ds r r 


2. Determine the radius of the circle, closest to the center, that is a stable orbit for a particle 
moving in the equatorial plane of the limiting (a—r,/2) Kerr field (R. Ruffini and J. A. Wheeler, 
1969). 

Solution: Proceeding as in problem | of $ 102, we introduce the "effective potential energy" 
U(r), defined from 


[C? +a?) U(r) -aLY — A[(aU(r) —L)? +r?2m?] = 0 


[for fo = U the right side of eq. (9) vanishes]. The radii of stable orbits are determined by the 
minima of the function Ut), i.e. by simultaneous solution of the equations U(r) = ĉo, U(r) = 0 
for U’(r)>0. The orbit closest to the center corresponds to U"(r,,,) = 0; fOr r < rmm the function 
U(r) has no minima. As a result we obtain the following values for the parameters of the motion: 
(a) When L « 0 (motion opposite to the direction of rotation of the collapsar) 
Fan 9 0 5 L li 


no m^ avy m, 3v5 
(b) For L>0 (motion in the direction of rotation of the collapsar) as a—r,/2 the radius r,,,, tends 
toward the radius of the horizon. Setting a = (r,.2) (1 +ò), we find, for 0—0: 


Cer HL +26), ^m^ — Hi +(40)4), 


ry r 


y 


Then 


l 
m = mr, = V3 [1 +(46)4). 
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We call attention to the fact that min “no, remains greater than 1 throughout, i.e. the orbit does 
not go outside the horizon. This is as it should be: the horizon is a null hypersurface, and no time- 
like world lines of moving particles can lie on it. 


$ 105. Gravitational fields at large distances from bodies 


Let us consider a stationary gravitational field at large distances from the body producing 
it, and determine the first terms in its expansion in powers of I/r. 

Far from the body, the field is weak. This means that there the space-time metric is almost 
galilean, i.e. we can choose a reference system in which the components of the metric tensor 
are almost equal to their galilean values: 


goo = l. d$ =O gi = Onn (105.1) 
We accordingly write the g;, in the form 
Gin = di hae (105.2) 


where the Aj, are small corrections determined by the gravitational field. 

In operating with the tensor /,, we shall agree to raise and lower its indices using the 
*unperturbed" metric: At = g'h, etc. Here we must distinguish the /i* from the correc- 
tions to the contravariant components of the metric tensor g^. The latter are determined 
by solving the equations: 

gug” = (gi + hag” = ði, 
so that, to terms of second order, we find: 
g* = g 9 — nh + hih". (105.3) 
To this same accuracy, the determinant of the metric tensor is 
g = gQ rhe? —ihb, (105.4) 


where h=hi. 

We emphasize immediately that the condition that the A, be small in no way fixes a 
unique choice of reference frame. If this condition is satisfied in any one system, it will 
also be satisfied after any transformation x! = x'+é', where the č! are small quantities. 
According to (94.3). the tensor /,, then goes over into 

ee (105.5) 
CX* Cx 
where č! = gi’¢* [because of the constancy of the gí? the covariant derivatives in (94.3) 
reduce to ordinary derivatives in the present case]. 

In first approximation, to terms of order l;r, the small corrections to the galilean values 
are given by the corresponding terms in the expansion of the centrally symmetric Schwarzs- 
child metric. Because of the indeterminacy in the choice of reference frame (galilean at 
infinity) mentioned above, the specific form of the //;, depends on the way the radial coordi- 


t For a stationary field it is natural to admit only those transformations that preserve the time-indepen- 
dence of the g,,, i.e. the £ must be functions only of the space coordinates. 
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nate r is defined. Thus. if the Schwarzschild metric is written in the form (100.14) the first 
terms in its expansion for large r are given by the expression (100.18). Changing from 
spherical spatial coordinates to cartesian (for which we must write dr = n,dx*, where n 
is the unit vector in the direction of r), we obtain the following values: 
hoy = N^ "TE hi = 0, 105.6 
1 "E 118 i Dyllg, (105.6) 
where r, = 2km/c?.+ 
Among the terms of second order proportional to l/r?, there are terms having two 
different origins. Some of the terms result from the effect of nonlinearity of the Einstein 
equations on the first-order terms. Since the latter depend only on the mass (and on no 
other characteristics of the body). these second-order terms can depend only on the mass. 
lt is therefore clear that these terms can be obtained by expanding the Schwarzschild 
metric. In these coordinates. we find 


2 
hy -0. AG) = 6 nh. (105.7) 


The remaining second-order terms arise as the corresponding solutions of the linearized 
field equations. Having in mind later applications, we shall carry out the linearization of the 
equations with the formulas written in more general form than is needed here; at the start 
we shall not make use of the stationarity of the field. 

For small A; the quantities L';,, expressed in terms of derivatives of the Aj, are also 
small. Neglecting powers higher than the first, we can keep in the curvature tensor (92.1) 
only the terms in the first bracket: 

2 2 2 2 
ae pu ILC Ch, Oh, ) (105.8) 


Ox*6x! Exiax™ ôxiôx! Ox üx" 
For the Ricci tensor, we have to this same accuracy: 


Rig - g' MR iim © go Rites 


l e Bei ant, — ah 
Ra = 3(- go ——* xoa) (105.9) 


Or 


2 exlüx" T ada Ox! MUSS ~ Oxiüx* 


The expression (105.9) can be simplified by making use of the remaining arbitrariness 
in the choice of reference frame. We can impose on the /;, four (the number of arbitrary 
functions ¢') supplementary conditions 

ôy! 


ab =O WE = hi Bh. (105.10) 


+ If, however, we start from the Schwarzschild metric in isotropic spatial coordinates (cf. problem 4 
of § 100), we would get: 


AD == 0. (105.6a) 


r 
DOREM kh = —2 ô 
o0 r af r af’ 


The shift from (105.6) to (105.6a) is accomplished by the transformation (105.5) with 


rx 
2r 


332 THE FIELD OF GRAVITATING BODIES § 105 


Then the last three terms in (105.9) cancel one another, and we are left with 
Cth. 
Ry = —ig"? ——. (105.11) 


In the stationary case that we are considering here, when the A; do not depend on the 
time, the expression (105.11) reduces to Rj, = 4Ah;,,, where A is the Laplace operator in 
the three spatial coordinates. The Einstein equations for the field in vacuum thus reduce 
to the Laplace equation 


Ah, = 9, (105.12) 
with the supplementary conditions (105.10), which take the form 
G d 
T (hf --4hd!) = 0, (105.13) 
ô e 
ax ho = 0. (105.14) 


We call attention to the fact that these conditions still do not completely fix a unique choice 
of reference frame. It is easy to see that if the A,, satisfy eqs. (105.13-14), then the same 
conditions will be satisfied by the Aj, of (105.5) so long as the £' satisfy the equations 


Ač = 0. (105.15) 


The component hoo must be given by a scalar solution of the three-dimensional Laplace 
equation. We know that such a solution, proportional to l/r?, has the form a-V(I/r), 
where a is a constant vector. But a term of this type in Ag, can always be eliminated by 
simply shifting the coordinate origin in the term of first order in l/r. Thus the presence of 
such a term would indicate only a poor choice of the coordinate origin, and is therefore 
not of interest. 

The components ho, are given by a vector solution of the Laplace equation, i.e. they 
must have the form 3d 


hoa = Ras —3-— 
0a UT ayh y 


where 4,5 is a constant tensor. The condition (105.14) gives 


9? l 


^ aaax r 


from which it follows that the 4,, must have the form a,,+A6,,, where a,, is an anti- 


^ 


; ; e] ct 
symmetric tensor. But a solution of the form 4 =~; - can be eliminated by the transforma- 
ex r 


tion (105.5) with £? = 4/r, &* = 0 [satisfying the condition (105.15)]. Thus the only solution 
that has real meaning is 
h ô l 
= Aag Zg 
pe P xl, 
Finally, by similar but more complicated arguments one can show that by a suitable 
transformation of the spatial coordinates one can always eliminate the quantities h,, given 
by a tensor solution (symmetric in x and £) of the Laplace equation. 
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As for the tensor a,,. it is related to the total angular momentum tensor My, and the 
final expression for Ap, has the form 


"NES. jt. Be Wh 
V =G Map y. cu Meg (105.16) 


We show this by calculating the integral (96.17). 
The angular momentum M,, is related only to the hoa and so in calculating it we may 
assume that all other components of hix are absent. To terms of second order in the hoa, 


we have from (96.2-3) (we note that g^^ = —/"? = hyo, while —g differs from unity only 
by a term of second order: 
c^ € 4 g 
208 _ a0 „8y 70 28) _. 
Eug ge EU go eode — 90,4). 


l6nk ex? ^ 


Upon substituting from (105.16), the second term under the derivative sign vanishes, 
while the first gives 
"EC 9) | c 3ngn, ~ Spy 


Bn "Ox'Ox'r Br 7 r? 


Using this expression we find that, carrying out the integration in (96.16) over the surface 
of a sphere of radius r (df, = n,r7do): 


1 
E [oewer = 
c 
1 5 ` 
i =F [merus aate = —Y0,,M ,, —05,M,,) = $M a. 


An analogous calculation gives: 


Id m c? . 
L Jta - icai] df tt = LM. 
Adding these two quantities, we obtain the required value of Mg. 

We emphasize that in the general case, when the field near the body may not be weak, 
M, is the total angular momentum of the body together with its gravitational field. Only 
if the field is weak at all distances can its contribution to the angular momentum be 
neglected. t 

Formulas (105.6-7) and (105.16) solve our problem to terms of order 1/r?.? The covariant 
components of the metric tensor are: 


ga = gi hi + his. (105.17) 
According to (105.3), to this same accuracy the contravariant components are 
g* 2 GUI eth hD khi, (105.18) 


^ If the rotating body is spherical in shape, the direction of M remains the only distinguished direction 
fo: the field in all the space outside the body. If the field is weak everywhere (and not just at large distances 
irom the body) formula (105.16) is valid over all the space outside the body. This formula remains valid 
over ail space also for the case when the centrally symmetric part of the field is not weak everywhere, but the 
spherical body rotates sufficiently slowly (cf. problem 1). 

+ The transformations (105.5) with £9 = 0, £* = £* (x!, x?, x?) do not change the hoa. Thus the expression 
(105.16) does not depend on the choice of the coordinate r. 
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Formula (105.16) can be rewritten in vector form as t 


2k 
g= = nx M. (105.19) 
Gp 


where M is the total angular momentum vector of the body. It was shown in problem 1 of 
§ 88 that in a stationary gravitational field a “Coriolis force" acts on the body which is the 
same as that which would act on the body in a reference frame rotating with angular 
velocity 


A 


= > \ Goo Curl g 


to 


We may therefore say that in the field of a rotating body a coriolis force acts on a distant 
particle with strength corresponding to an angular velocity: 


k 
Q s Scu g = 5 [M-3n(M-n)}. (105.20) 
CF 
Finally, we apply the expression (105.6) to calculate the total energy of a gravitating 


body using the integral (96.16). Computing the necessary components of A^ from formulas 
(96.2-3), we find to the required accuracy (we keep terms to order 1/r?): 


h = 0, 
4297 = c (g° Og?) = me? d E I9* Xu = me? " 
l6nk 2 8x Ox? rp 4n r? 


Now, integrating in (96.16) over a sphere of radius r, we get, finally, 
=0, P? = mc, (105.21) 


a result which was naturally to be expected. It is an expression of the equality of "gravita- 
tional" and “‘inertial’’ mass ("gravitational mass is the mass that determines the gravita- 
tional field produced by the body, the same mass that appears in the metric tensor in a 
gravitational field, or, in particular, in Newton's law; "inertial" mass is the mass that 
determines the ratio of energy and momentum of the body; in particular, the rest energy 
of the body is equal to this mass multiplied by c?). 

In the case of a constant gravitational field it is possible to derive a simple expression 
for the total energy of matter plus field in the form of an integral only over the space 
occupied by the matter. We can do this, for example, by starting from the following expres- 
sion, which is valid when all quantities are independent of x?:1 

t To the assumed accuracy, the vector 9, = —Goz/9oo ¥ — o4. For this same reason, in the definitions of 


vector product and curl (cf. the footnote on p. 253) we must set y = 1, so that they may be understood in 
their usual sense for cartesian vectors. 


+ From (92.7) we have 
R? = g% Rio = g? (GS -T!o PRT e.) 


and, using (86.5) and (86.8), we find that this expression can be written as 


1 c i 
Queue d — r m : 
R$ Y =o ox zu —g g” Tio) — g I9; To) 
using the same relation (86.8) it is easy to show that the second term on the right is identically ev: to 


- M. rr "x and, because all quantities are independent of x. is equal to zero. Finally. replacing the sum- 


mation over / by one over a in the first term, for this same reason, we get (105.22). 
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o 168 
Ro = WETE = (V ~g 9'° T$). (105.22) 


Integrating R$ / =g over (three-dimensional) space and using the three-dimensional 
Gauss formula, we obtain: 


Jes Sg wag SIGTE df. 


Taking a sufficiently remote surface for the integration and using the expressions (105.6) 
for the g,;, we find, after simple calculations: 


- 4nk 4nk 
[rsa dv= mo po 
c 


c? 
Noting also that. according to the field equations. 


8xk Ank 2 
Ro = <a (To~4T) =a (To-T1-T3-T)). 


we get the required formula: 


1 -—- 
P9 = mc = = [et ri-r2- T3 — g dV. (105.23) 


C 


This formula expresses the total energy of the matter and the constant gravitational field 
(i.e., the total mass of the body) in terms of the energy-momentum tensor of the matter 
alone (R. Tolman, 1930). We recall that in the case of central symmetry of the field we 
had still another expression for this quantity—formula (100.23). 


PROBLEMS 


1. Show that formula (105.16) remains valid for the field over all space outside of the rotating 
spherical body under conditions of slow rotation (M-Zcmr,) but without the requirement that the 
centrally symmetric part of the field be small (A. G. Doroshkevich, Ya. B. Zel'dovich and I. D. 
Novikov, 1965; V. Gurovich, 1965). 

Solution: 1n spherical spatial coordinates (x! = +, x? — 0, x? = d) formula (105.16) is written 
as 

hos = 2kM sin? 6. (1) 
re 
Considering this quantity to be a small correction to the Schwarzschild metric (100.14), we must 
verify that the equation Ros = 0, linearized in /to;, is satisfied (since in the other field equations 
the correction terms drop out identically). Ros can be calculated using formula (4) of the problem 
in § 95, where the linearization means that the three-dimensional tensor operations should be carried 
out with the “unperturbed” metric (100.15). As a result we get the following equation 


G- Cos nn sin ô ( l Ohos -0 
ayuu LEN ans o) =O 


which expression (1) actually does satisfy. 

2. Determine the systematic ("secular") shift of the orbit of a particle moving in the field of a 
central body, associated with the rotation of the latter (J. Lense, H. Thirring, 1918). 

Solution: Because all the relativistic effects are small, they superpose linearly with one another, 
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so in calculating the effects resulting from the rotation of the central body we can neglect the in- 
fluence of the non-Newtonian centrally symmetric force field which we considered in § 101; in other 
words, we can make the computations assuming that of all the hx only the Aoa are different from 
zero. 

The orientation of the classical orbit of the particle is determined by two conserved quantities: 
the angular momentum of the particle, M =r xp, and the vector 
kmm'r, 

r 


A=? xM- 
m 


whose conservation is peculiar to the Newtonian field e = —Akm’/r (where m' is the mass of the 
central body). T The vector M is perpendicular to the plane of the orbit, while the vector A is directed 
along the major axis of the ellipse toward the perihelion (and is equal in magnitude to kmnr'e, 
where e is the eccentricity of the orbit). The required secular shift of the orbit can be described in 
terms of the change in direction of these vectors. 


The Lagrangian for a particle moving in the field (105.19) is 
k 
L=-m cF e Lot dt, ôL — mcg: vM UXE (1) 


(where we denote the angular momentum of the central body by M’ to distinguish it from the angular 
momentum M of the particle). Then the Hamiltonian is [cf. Mechanics (40.7)]: 


H = H+ 5H, xu Me Xp 


Computing the derivative M —ixp-rxp using the Hamilton equations è= êX /ĉp, 
p —(e»f[?r), we get: 2k 
M = -~; M xM. (2) 


Since we are interested in the secular variation of M, we should average this expression over the 
period of rotation of the particle. The averaging is conveniently done using the parametric represen- 
tation of the dependence of r on the time for motion in an elliptical orbit, in the form 


r — a(1—e cos &), t -I (£—e sin &) 
2n 


(a and e are the semimajor axis and eccentricity of the ellipse; cf. Mechanics, § 15): 


732 ZH 1 
i 7 r? J Tecos €)? aXI1—em 
o 


Thus the secular change of M is given by the > formula 

dM 2kM’ xM 

"dr 7 Fax eie e 
i.e. the vector M rotates around the axis of rotation of the central body, remaining fixed in mag- 


nitude. 
An analogous calculation for p^ vector À gives: 


À- i AMX A+ - ok vo (M: M’)(r x M). 


The averaging of this expression is carried out in the same way as before; from symmetry considera- 
tions it is clear beforehand that the averaged vector r/r? will be along the major axis of the ellipse, 
i.e. along the direction of the vector A. The computation leads to the following expression for the 
secular change of the vector À: 


dA 2kM' 


— -0xA, Q= AES jean '—3n(n : n?) (4) 


1 See Mechanics, § 15. 
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(n and n’ are unit vectors along the directions of M and M’), i.e. the vector A rotates with angular 
velocity Q, remaining fixed in magnitude; this last point shows that the eccentricity of the orbit 
does not undergo any secular change. 

Formula (3) can be written in the form 


with the same Q as in (4); in other words, Q is the angular velocity of rotation of the ellipse ‘‘as 
a whole". This rotation includes both the additional (compared to that considered in § 101) shift 
of the perihelion of the orbit, and the secular rotation of its plane about the direction of the axis of 
the body (where the latter effect is absent if the plane of the orbit coincides with the equatorial 
plane of the body). 
For comparison we note that to the effect considered in § 101 there corresponds 
|. 6nkm' 
~ gaü-er" 


§ 106. The equations of motion of a system of bodies in the second approximation 


As we shall see later ($ 110), a system of moving bodies radiates gravitational waves 
and thus loses energy. This loss appears only in the fifth approximation in l/c. In the first 
four approximations, the energy of the system remains constant. From this it follows 
that a system of gravitating bodies can be described by a Lagrangian correctly to terms 
of order l/c* in the absence of an electromagnetic field, for which a Lagrangian exists in 
general only to terms of second order (8 65). Here we shall give the derivation of the Lagran- 
gian of a system of bodies to terms of second order. We thus find the equations of motion 
of the system in the next approximation after the Newtonian. 

We shall neglect the dimensions and internal structure of the bodies, regarding them as 
*"pointlike"; in other words, we shall restrict ourselves to the zero'th approximation in the 
expansion in powers of the ratios of the dimensions a of the bodies to their mutual separa- 
tions /. 

To solve our problem we must start with the determination, in this same approximation, 
of the weak gravitational field produced by the bodies at distances large compared to their 
dimensions, but at the same time small compared to the wavelength 4 of the gravitational 
waves radiated by the system (a «& r « à ~ lejo). 

To terms of order I/c? the field far from the body is given by the expressions obtained 
in the preceding section and denoted there by A(U; here we use these expressions in the 
form (105.6a). In $ 105 it was tacitly assumed that the field was produced by just one body, 
located at the coordinate origin. But since the field Af] is a solution of the linearized Einstein 
equations, the principle of superposition holds for it. Thus the field far from a system of 
bodies is obtained by simply summing the fields of each of them; we write the field in the 
form 


hf = -569. (106.1) 


2 
Wg=0, hp. (00:2) 
c 
where 
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is the Newtonian gravitational potential of a system of point objects (r, is the radius vector 
to the body with mass m,). The expression for the line element with the metric tensor 
(106. 1-2) is: 


ds? = (14 E 4) c? dt? — (1- i 4) (dx? + dy? 4- dz?). (106.3) 


We note that first order terms containing $ appear not only in goo but also in g,,; in 
§ 87 it was already stated that, in the equations of motion of the particle, the correction 
terms in g, give quantities of higher order than the terms coming from goo; as a consequence, 
of this, by a comparison with the Newtonian equations of motion we can determine only 
goo: 

As will be seen from the sequel, to obtain the required equations of motion it is sufficient 
to know the spatial components h, to the accuracy (~ 1/c?) with which they are given in 
(106.1); the mixed components (which are absent in the 1/c? approximation) are needed to 
terms of order 1/c?, and the time component hog to terms in 1/c*. To calculate them we turn 
once again to the general equations of gravitation, and consider the terms of corresponding 
order in these equations. 

Disregarding the fact that the bodies are macroscopic, we must write the energy-momen- 
tum tensor of the matter in the form (33.4), (33.5). In curvilinear coordinates, this expression 
is rewritten as 


k 
— ó(r—r,) (106.4) 


[for the appearance of the factor IN —g, see the analogous transition in (90.4)]; the sum- 
mation extends over all the bodies in the system. 
The component 
m,c? dt 
Too — 3 Jz; 90 ds ó(r —r,) 


in first approximation (for galilean gx) is equal to $; m, c? ó(r— r,); in the next approxima- 


tion, we substitute for g; from (106.3) and find, after a simple computation: 


5 : 
Too = F m,c? (1 + x 4 zs) &(r—r,), (106.5) 


where v is the ordinary three-dimensional velocity (v* = dx*/dt) and 6$, is the potential 
of the field at the point r,. (As yet we pay no attention to the fact that $, contains an in- 
finite part—the potential of the self-field of the particle m,; concerning this, see below.) 

As regards the components Tag, Toa of the energy-momentum tensor, in this approxima- 
tion it is sufficient to keep for them only the first terms in the expansion of the expression 
(106.4) 


Tag = Y m, Vaa Vap é(r —r,), 
Toa =-=- Y m, CU gq Ó(T —r,). 


Next we proceed to compute the components of the tensor Ra. The calculation is con- 
veniently done using the formula Ry = g'"Ri,, with Ri, given by (92.1). Here we must 
remember that the quantities A, g and Aoo contain no terms of order lower than 1/c?, and hoa 


(106.6) 
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no terms lower than 1/c?; differentiation with respect to x? = ct raises the order of smallness 
of quantities by unity. 

The main terms in Rog are of order 1/c?; in addition to them we must also keep terms of 
the next non-vanishing order 1/c*. A ume computation gives the result: 


a 1 ht 1 2 
Roo — : es: — I ô Se) +5 +- Ahoot = l pe ô hoo _ (oy — 


c ôt \Ox* 2c ôt 2 Ox'ex? Ox" 
_ 1 Ohoo 5 Oh oh? 
4 0x XV Ox* axF J’ 
In this computation we have still not used any auxiliary condition for the quantities Aj. 
Making use of this freedom, we now impose the condition 
ôhg 10h, 
-~-—=0 . 
ôx” 2c ôt , (106.7) 


as a result of which all the terms containing the components Ao, drop out of Roo. In the 
remaining terms we substitute 


2 1 
he = ie, hoo = 32 $40 (5). 
c c 


and obtain, to the required accuracy, 


1 2 2 
Roo = 2 Aloo + a gAg—- dá (VA, (106.8) 


where we have gone over to three-dimensional notation. In computing the components 
Ro, it is sufficient to keep only the terms of the first nonvanishing order—1/c?. In similar 
fashion, we find: 


1 eihP i! Q^ hb 1 2h QI 1 LA 
9" 2cótüx 20x 0x! 2cót0x* loa 
and then, using the condition (106.7): 


1 1 9 
=-Al , 
Roa = 5 Ahoa + 53 graye (106.9) 
Using the expressions (106.5)-(106.9), we now write the Einstein equations 
8nk 1 
Ry = r (Ta- jar). (106.10) 


The time component of equation (106.10) gives: 


Ahoo+ - t pAb- AGO = =m, c (7 36s a +3) ora: 


making use of the identity 
AV)? = 2A(6?) -46^ó 
and the equation of the Newtonian potential 


Ad = 4nk Y m,ó(r—r,), (106.11) 
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we rewrite this equation in the form 


2 8k $; 3 
A (hoo 5 2) =F Em, (i$ +5) &(r —r,). (106.12) 
After completing all the computations, we have replaced 4, on the right side of (106.12) by 
, : m, 
pe -k5 DL. 
47 7L noni 


i.e. by the potential at the point r, of the field produced by all the bodies except for the 
body m,; the exclusion of the infinite self-potential of the bodies (in the method used 
by us, which regards the bodies as pointlike) corresponds to a “renormalization” of their 
masses, as a result of which they take on their true values, which take into account the field 


produced by the bodies themselves. t 
The solution of (106.12) can be given immediately, using the familiar relation (36.9) 


We thus find: 


lige te Se y l 
100 2 c c Ir-r, " b» r—r,| (106.13) 
The mixed component of equation (106.10) gives: 
l6xk | e$ 
Aho. = — EE 2 m, Vaa Ó(T—r,) — SAO (106.14) 
The solution of this linear equation is? 
4k — mv | os 
| zx a"ag — 7 7 J 
t= Lan se 
where f is the solution of the auxiliary equation 
km 
Af=¢@=-Y¥-—*. 
f=o 2; ex 
Using the relation Ar = 2/r, we find: 
k 
f=— => m,|r-r,|, 
25 
and then, after a simple computation, we finally obtain: 
k m, 
Nog = 2c 2: \r—r,| [o t (v, n)", ], (106.15) 


t Actually, if there is only one body at rest, the right side of the equation will have simply (8ztk/c? mjó(r-r,), 
and this equation will determine correctly (in second approximation) the field produced by the body. 

1 In the stationary case, the second term on the right of equation (106.14) is absent. At large distances 
from the system, its solution can be written immediately by analogy with the solution (44.3) of equation (43.4) 


2k 
hoa = er (M xn)a 


(where M = frxpvdV = Y mata X Ya is the angular momentum of the system), in agreement with formula 
(105.19). 
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where n, is a unit vector along the direction of the vector r—r,. 
The expressions (106.1), (106.13) and (106.15) are sufficient for computing the required 
Lagrangian to terms of second order. 
The Lagrangian for a single body, in a gravitational field produced by other bodies and 
assumed to be given, is 
L= —m 


ds pt v? viu? 1/2 
— 2 a a 
aC t = MaC (1 + hoo t 2hg, t — à th v s) . 


Expanding the square root and dropping the irrelevant constant —#1,c?, we rewrite this 
expression, to the required accuracy, as 


Mata | mU; 2 [hoo Va l "E ligo | hoo 2 
Lp + gee Ue + 


(106.16) 
Here the values of all the 4, are taken at the point r,; again we must drop terms which become 
infinite, which amounts to a “renormalization” of the mass m, appearing as a coefficient 
in L,. 

The further course of the calculations is the following. The total Lagrangian of the system 
is, of course, not equal to the sum of the Lagrangians L, for the individual bodies, but 
must be constructed so that it leads to the correct values of the forces f, acting on each of 
the bodies for a given motion of the others. For this purpose we compute the forces f, by 
differentiating the Lagrangian L,: 

(8) 
"o N ÔE 7r 


the differentiation is carried out with respect to the running coordinate r of the "field 
point" in the expressions for Aix). It is then easy to form the total Lagrangian L, from which 
all of the forces f, are obtained by taking the partial derivatives 0L/ér,. ` 

Omitting the simple intermediate computations, we give immediately the final result for 
the Lagrangian:t 


mvs ,km,m, 
Lapp St) +E Ee tly 
Emm k?m,m,m, 
-ED ge, [Maver Wome mI“ EEL nz 5, (106.17) 
a c ab' ac 


where ra = |r, —r;|, Da is a unit vector along the direction r,—r,, and the prime on the 
summation sign’ means that we should omit the term with b = a or c =a. 


PROBLEMS 


1. Find the action function for the gravitational field in the Newtonian approximation. 
Solution: Using the g;, from (106.3), we find from the general formula (93.3), G = (2/c*)(V@)’, 


so that the action for the field is 
1 2 
$,- — i Í Í (Vo)? dV dt. 


t The equations of motion corresponding to this Lagrangian were first obtained by A. Einstein, L. Infeld 
and B. Hoffmann (1938) and by A. Eddington and G. Clark (1938). 
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The total action, for the field plus the masses distributed in space with density y, is: 


s-[f E -no-z (V9? | dV dt. (1) 


One easily verifies that variation of S with respect to 9 gives the Poisson equation (99.2), as it 
should. 

The energy density is found from the Lagrangian density A [the integrand in (1)] by using the 
general formula (32.5), which reduces in the present case (because of the absence of time derivatives 
of gin A) to changing the signs of the second and third terms. Integrating the energy density over 
all space, where we substitute 49 = (1/4nk)g Ag in the second term and integrate by parts, we 
finally obtain the total energy of field plus matter in the form 


uv? 1 n 
Í k — Srk (Vo) | dV. 


Consequently the energy density of the gravitational field in the Newtonian theory is 
W = —(1/82k)(Vo)?.t 

2. Find the coordinates of the center of inertia of a system of gravitating bodies in the second 
approximation. 

Solution: In view of the complete formal analogy between Newton’s law for gravitational 
interaction and Coulomb’s law for electrostatic interaction, the coordinates of the center of inertia 
are given by the formula 


1 pi kma "mM 
= — 21.9. penis 
R pEr (mot Pe 2 2 


a , 
(Y(ne mE 


b lob 


which is analogous to the formula found in Problem 1 of $ 65. 


3. Find the secular shift of the perihelion of the orbit of two gravitating bodies of comparable 
mass (H. Robertson, 1938). 


Solution: The Lagrangian of the system of two bodies is 
y mri mri kmam, 
2 2 


l 4 4 
Sec Mr Tomos) 


km,m; 


XE [301 + 03) — 7v,- v. — (v, n)(v,:n)] — (DIT) m) 


2c°r? 
Going over to the Hamiltonian function and eliminating from it the motion of the center of inertia 
(see problem 2 in § 65), we get: 


ws P?(1 _1\_kmm_ pfi 1 
War lee a 


2 
= te Ea (= + =) +7p?+(p- al 4- k?my mim; + ms) (1) 
2 


1 2c?r? i 


where p is the momentum of the relative motion. 

We determine the radial component of momentum p, as a function of the variable r and the 
parameters M (the angular momentum) and 4 (the energy). This function is determined from the 
equation 3 = 4 (in which, in the second-order terms, we must replace p? by its expression from 
the zero'th approximation): 


T To avoid any misunderstanding, we state that this expression is not the same as the component (—g) foo 


of the energy-momentum pseudotensor (as calculated with the gx from (106.3)); there is also a contribution 
to W from (—g) Tx. 
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1/1 1 M? km,m 1 1 1 2m; m; V? k 2 
emp ee ae Sf pee yc Uu ef E rcd [ems RT. 
stu) c s) r 8c? (+3) (zm) (4+ r ) 


oo E (zz) +] 2m; ms («+ x) ok ia k?m, mom, Ema 


m; m m; mas r 2cr^* 2c?r? 


The further course of the computations is analogous to that used in $ 98. Having determined p, 
from the algebraic equation given above, we make a transformation of the variable r in the integral 


S, - f pdr, 


so that the term containing M? is brought to the form M ?/r?, Then expanding the expression under 
the square root in terms of the small relativistic corrections, we obtain: 


BRB 417 ELN g 
s= [as Z (me 5 2) ig 
r [^i r 


[see (101.6)], where A and B are constant coefficients whose explicit computation is not necessary. 

As a result we find for the shift in the perihelion of the orbit of the relative motion: 
õp = 6nk?m? mi m 6nk(m; + mz) 

cM? ca(1—e?) ` 
Comparing with (101.7) we see that for given dimensions and shape of the orbit, the shift in the 
perihelion will be the same as it would be for the motion of one body in the field of a fixed center of 
mass m; - 7n. 

4. Determine the frequency of precession of a spherical top, performing an orbital motion in the 
gravitational field of a central body that is rotating about its axis. 

Solution: In the first approximation the effect is the sum of two independent parts, one of which 
is related to the non-Newtonian character of the centrally symmetric field (H. Weyl, 1923) and the 
other to the rotation of the central body (L. Schiff, 1960). 

The first part is described by an additional term in the Lagrangian of the top, corresponding to 
the second term in (106.17). We write the velocities of individual elements of the top (with mass 
dm) in the form v = V-- xr, where V is the velocity of the orbital motion, € is the angular velocity, 
and r is the radius vector of the element dm relative to the center of the top (so that the integral over 
the volume of the top f r dm — 0). Dropping terms independent of œ and also neglecting terms 
quadratic in @, we have: 


SDL = 


3km Ja k am. 


2c? R 
where m is the mass of the central body, R =|Ro+r| is the distance from the center of the field 
to the element dm, Ro is the radius vector of the center of inertia of the top. In the expansion 


L/R z V Ro —(n:r/ R2) (where n = Ro/ Ro) the integral of the first term vanishes, while integration 
of the second term is done using the formula 


Í Xa Xs dm = Mói;; 


where / is the moment of inertia of the top. As a result we gct: 


50L = 3km’ 
2c°R? 
where M = /q is the angular momentum of the top. 

The additional term in the Lagrangian, duc to the rotation of the central body, can also be found 

from (106.17), but it is even simpler to calculate it using formula (1) of the problem in § 105: 
2k | M (o... D x R 
where M’ is the angular momentum of the central body. Expanding. 
R n 


RE” R? 


M-v xn, 


! 
+ Ri (r - 3n(n -r)) 
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and performing the integration, we get: 
PL = EN (M:M'—3(n: M)n: M’)}. 
c? R$ 


Thus the total correction to the Lagrangian is 
3km' _ k 
28R n X Vor ZR? 
To this function there corresponds the equation of motion 
aM =QxM 
dt 


[see equation (2) of the problem in § 105]. This means that thc angular momentum M of the top 
precesses with angular velocity Q, remaining constant in magnitude. 


ôL = —M:Q, Q= (3n(n- M) -M'j. 


CHAPTER 13 


GRAVITATIONAL WAVES 


§ 107. Weak gravitational waves 


Just as in electrodynamics, in the relativistic theory of gravitation the finite velocity of 
propagation of interactions results in the possibility of the existence of free gravitational 
fields that are not linked to bodies—gravitational waves. 

We consider the weak gravitational field in vacuum. As in § 105, we introduce the tensor 
hix, describing a weak perturbation of the galilean metric: 


gu = Qu thi. (107.1) 
Then, to terms of first order in the A;,, the contravariant metric tensor is: 
g“ = gnh, (107.2) 


and the determinant of the tensor gix: 
g = g™(1 +h), (107.3) 


where Az hí; all operations of raising and lowering tensor indices are done with the unper- 
turbed metric g”. 

As already pointed out in § 105, the condition that the 4;, be small leaves the possibility 
of arbitrary transformations of reference system of the form x’! = x/-- £, with small £'; 
then 

6g; ô 
DN en eed (107.4) 
Ox" Ox 
Using this arbitrariness of gauge for the tensor A;,, we impose on it the supplementary 
condition 
aw 
A m die h- goth, (107.5) 
x 


after which the Ricci tensor takes the simple form (105.11): 


Ry = ih. (107.6) 
where “7 denotes the d'Alembertian operator: 
1 ĉ? 
ed ek, im(0)A2. e 165 m. il. bll 3 
C g iCX CX age eT 


The conditions (107.5) still do not fix a unique choice of reference frame: if certain A; 


345 
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satisfy these conditions, then so will the A;, of (107.4), if only the £' are solutions of the 
equations ; 
De = 0. (107.7) 
Equating (107.6) to zero, we thus find the equations for the gravitational field in vacuum 

in the form 
Dh; = 0. (107.8) 


This is the ordinary wave equation. Thus gravitational fields, like electromagnetic fields, 
propagate in vacuum with the velocity of light. 

Let us consider a plane gravitational wave. In such a wave the field changes only along 
one direction in space; for this direction we choose the axis x! = x. Equation (107.8) then 


changes to 
e? 10\, 
(za z =) hj = 0, (107.9) 
the solution of which is any function of t+ x/c (§ 47). 

Consider a wave propagating in the positive direction along the x axis. Then all the quan- 
tities h% are functions of t—x/c. The auxiliary condition (107.5) in this case gives 
V1 —? =0, where the dot denotes differentiation with respect to t. This equality can be 
integrated by simply dropping the sign of differentiation—the integration constants can 
be set equal to zero since we are here interested only (as in the case of electromagnetic 
waves) in the varying part of the field. Thus, among the components w* that are left, we 
have the relations 


Wie, =y y=  wo= ve: (107.10) 


As we pointed out, the conditions (107.5) still do not determine the system of reference 
uniquely. We can still subject the coordinates to a transformation of the form 
x'! = x'+é(t—x/c). These transformations can be employed to make the four quantities 

9, WS, WS, Vi -- 3 vanish; from the equalities (107.10) it then follows that the components 
V!, V1, V1, Wo also vanish. As for the remaining quantities y3, V2— v3, they cannot be 
made to vanish by any choice of reference system since, as we see from (107.4), these com- 
ponents do not change under a transformation £; = £(t— x/c). We note that y = Wi also 
vanishes, and therefore wt = hi. 

Thus a plane gravitational wave is determined by two quantities, A3, and h33 = — A33. 
In other words, gravitational waves are transverse waves whose polarization is determined 
by a symmetric tensor of the second rank in the yz plane, the sum of whose diagonal terms. 
hy. - 533, IS zero. 

For the two independent polarizations we may choose the cases in which one of the two 
quantities A3, and 1(/;; — A33) differs from zero. These two polarizations are distinguished 
from one another by a rotation through z/4 in the yz plane. 

Let us calculate the energy- momentum pseudotensor in a plane gravitational wave. 
The components 1^ are second-order quantities; we must calculate them neglecting terms 
of still higher order. Since, when A = 0, the determinant g differs from g‘°’ = — | only by 
terms of second order, we can, in the general formula (96.9), set g^, xg, x — ^ ,. For 
a plane wave all the other nonzero terms in // are contained in the term 


41g" OG 99", 975, = MT hi^ 


in curly brackets in (96.9) (as is easily shown by choosing one of the axes of a galilean 
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system of reference along the direction of propagation of the wave). Thus, 


c* 


DO 0 RM, 107.11 

32nk * ( ) 

The energy flux in the wave is given by the quantities —cgt9? z:ci9*. In a plane wave, 

propagating along the x! axis, in which the nonzero quantities ha, and A,, = —hy, 
depend only on the difference 1 — x/c, this flux is also along x! and is equal to 


3 
t 
ct?! Pe 


ara! [h33+3(hz2—h33)*]. (107.12) 


As initial conditions for the arbitrary field of a gravitational wave we must assign four 
arbitrary functions of the coordinates: because of the transversality of the field there are 
just two independent components of A,,, in addition to which we must also assign their 
first time derivatives. Although we have made this enumeration here by starting from the 
properties of a weak gravitational field, it is clear that the result, the number 4, cannot be 
related to this assumption and applies for any free gravitational field, i.e. for any field which 
is not associated with gravitating masses. 


PROBLEMS 


Determine the curvature tensor in a weak plane gravitational wave. 
Solution: Calculating R from (105.8), we find the following nonzero components: 


iklm 
— Rozo2 = Rosos = — Riaiz = Rozi? = Rossi = Raia = 6, 
Roszos = — Ri231 = —Rosi2 = Rozaa = #, 
where we use the notation 
g= — bhas = Viaz, u= — Vias. 


In terms of the three-dimensional tensors A,, and B,, in (92.15), we have: 


0 0 0 00 0 
Aa={90 —oe a], B,—-[0 u oc 
0 u c 0 c —u 


By a suitable rotation of the x?, x? axes, we can make one of the quantities o or 4 vanish (at a 
given point of four-space); if we make c vanish in this way, we reduce the curvature tensor to the 
degenerate Petrov type II (type N). 


$ 108. Gravitational waves in curved space-time 


Just as we have treated the propagation of gravitational waves "on the background" 
of a flat space-time, we can consider weak perturbations relative to an arbitrary (non- 
galilean) “unperturbed” metric gí?)" Also anticipating other possible applications, we 
shall write the necessary formulas in a more general form. 

Again taking the g,, in the form (107.1), we find the first order correction to the Christoffel 
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symbols expressed in terms of the hy: 
Di = AL hia — hu). (108.1) 


which can be verified by direct calculation (here, as in the sequel, all tensor operations of 
raising and lowering of indices, and covariant differentiation, are done with the nongalilean 


metric g{°’). We find for the corrections to the curvature tensor: 
R*D = Mua + LP — haa — has — him + haim) (108.2) 
The corrections to the Ricci tensor are then 
RYO, = ROS, = HUM + hii — hy), — ha). (108.3) 


The corrections to the mixed components of the Ricci tensor are obtained from the relations 


KO : 
RM " + RD, = (RO, + RP Ag — h), 


so that 


RD), = g" RU _ hE RO (108.4) 

The exact metric in vacuum must satisfy the exact Einstein equations Aj, = 0. Since the 

unperturbed metric g{° satisfies the equations R'9, = 0, we find for the perturbation, 
RO. = 0,1.e., 


hi ua iaa haa hug = 0. (108.5) 
In the general case of arbitrary gravitational waves, simplification of this equation to a 
form like (107.8) is not possible. This can, however, be done in the important case of waves 
of high frequency: when the wavelength 4 and the oscillation period A/c are small compared 
to the characteristic distances L and times L/c over which the “background field" changes. 
Each differentiation of a component 4,, increases the order of the quantity by a factor L/A 
relative to derivatives of the unperturbed metric g‘?’. If we limit the accuracy to terms of the 
two highest orders [(£/2)? and (L/A)] we can interchange the orders of differentiation; 
in fact, the difference 


I i l pmí(0) m pl(0) 
higa — hiir m BQRUU4, — hDR'Ü mi 


is of order (L/A)°, whereas each of the expressions hia and NM contains terms of both 
higher orders. imposing on A, the supplementary conditions 
ix = 0 (108:6) 
[analogous to (107.5)], we get the equation 
ha, = 0 (108.7) 
which generalizes (107.8). 

For the reasons given in 8 107, the condition (108.6) does not fix a unique choice of coordi- 
nates. They can still be subjected to a transformation x’! = x'-- £', where the small quanti- 
ties &/ satisfy the equation £^*, = 0. These transformations can be used, in particular, 
to impose on the A, the condition ^ai = 0. Then y* = Ak, so that the /* are subjected 
to conditions 

ht, —-0, h «0. (108.8) 


After this the set of admissible transformations is reduced to the requirement é',, = 0. 
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The pseudotensor r contains, in addition to the unperturbed part r“”’, terms of various 
orders in the A; We arrive at an expression analogous to (107.11) if we consider the quanti- 
ties r* averaged over regions of four-space with dimensions large compared to 4 but small 
compared to L. Such an averaging (which we denote by the angular brackets <... >) does 
not affect the gf?! and annihilates all quantities that are linear in the rapidly oscillating 
quantities /r,. Of the quadratic terms, we preserve only the terms of higher (second) order 
in I/A; these are the terms quadratic in the derivatives h,, s Oh, /Ox'. 

To this accuracy, all terms in r’* that are expressed as four-divergences can be dropped. 
In fact, the integrals of such quantities over a region of four-space (the region of averaging) 
are transformed by Gauss’ theorem, as a result of which their order of magnitude in 1/4 
is reduced by unity. [n addition, those terms drop out which vanish because of (108.7) 
and (108.8) after integration by parts. Thus, integrating by parts and dropping integrals 
of four-divergences, we find: 


<h” hp = —<h'"hP, > = 0, 
Che „hiy = —( hi, = 0. 
As a result the only second-order terms that remain are 


c* 


32nk 


qM = (i h^». (108.9) 
We note that to this same accuracy, (ri? »? = 0. 

Since it has a definite energy, the gravitational wave is itself the source of some additional 
gravitational field. Like the energy producing it, this field is a second-order effect in the Aj,. 
But in the case of high-frequency gravitational waves the effect is significantly strengthened: 
. the fact that the pseudotensor 1 is quadratic in the derivatives of the Aj, introduces 
the large factor 4^ ?. [n such a case we may say that the wave itself produces the background 
field on which it propagates. This field is conveniently treated by carrying out the averaging 
described above over regions of four-space with dimensions large compared to À. Such 
an averaging smooths out the short-wave "'ripple" and leaves the slowly varying back- 
ground metric (R. A. Isaacson, 1968). 

To derive the equation determining this metric, we must, in expanding the A, keep 
not only linear terms but also quadratic terms in A: Rj, = RP + RY + Ri’. As already 
pointed out, the averaging does not affect the zero-order terms. Thus, the averaged field 
equations (R;,> = O take the form 


RP = QR. (108.10) 


where we should keep only terms of second-order in 1/2 in R’. They are easily found from 
the identity (96.7). The terms quadratic in A; that arise on the right side of this identity, 
and have the form of a four-divergence, vanish (to the accuracy considered) when the 
averaging is done, and there remains 


8xk |. 
(R* —Ag* RJ?» = -— (pO, 
[4 


or, since (1/9?) = 0, to this same accuracy: 


8nk 
(P = — ae d 
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Finally, using (108.9), we get eq. (108.10) in the final form 
Rip? = Aha. (108.11) 


If the "background" is produced entirely by the waves themselves, (108.11) and (108.7) 
must be solved simultaneously. An estimate of the expressions on both sides of (108.11) 
shows that in this case the radius of curvature of the background metric, which is of order 
L, is related to the wavelength å and the order of magnitude of its field h by L^? ~A?/2?, 
i.e. A/L~h. 


$ 109. Strong gravitational waves 


In this section we shall consider the solution of the Einstein equations which is a general- 
ization of the weak, plane gravitational wave in a flat space-time (I. Robinson and H. Bondi, 
1957). 

We shall look for a solution in which, in a suitable reference frame, all the components 
of the metric tensor are functions of a single variable, which we call x? (without, however, 
prejudging its character). This condition still permits coordinate transformations of the 
form 

x*—x* +4 $* (x9), (109.1) 
x? 5 $*(x?), (109.2) 
where $, $* are arbitrary functions. 

The character of the solution depends essentially on whether we can make all the go, 
vanish by using the three transformations (109.1). This can be done if the determinant 


lJagl #0. In fact, under the transformation (109.1), Gor9 oat Gash? (where the dot denotes 
differentiation with respect to x°); if |g,5| #0, the system of equations 


Gout Gap? = 0 
determines the $^(x?) that accomplish the required transformation. Such a case will be 
treated in $ 117; here we shall be interested in the solution in which 
Gap = 0. (109.3) 


In this case there is no reference system in which all the go, = 0. Instead, however, the 
four transformations (109.1-2) can be used to make 


goi = 1, goo = Gor = Joz = O. (109.4) 


Here the variable x? has “‘lightlike’” character: for dx* = 0, dx? #0, the interval ds = 0; 
we shall denote the variable x? chosen in this way by x? = n. Under the conditions (109.4) 
the line element can be written in the form 


ds? = 2dx! dg-- g (dx? - g^dx )(dx* &- g*ux!). (109.5) 


Throughout this section, the indices a, b, c, . . . take on values 2, 3; g,,(n) can be regarded 
as a two-dimensional tensor. Calculation of the quantities R,, leads to the following field 
equations: 


Ra, = —394.9 9:40" = 0. 
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It then follows that ga. g^ = 0, or g° = 0, i.e. g^ = const. We can, therefore, by a transforma- 
tion x^ 4 g^x'— x^, bring the metric to the form 


ds? = 2dx! dn+gas(n)dx%dx°. (109.6) 


The determinant —g of this metric tensor coincides with the determinant |g,,|, while 
the only nonzero Christoffel symbols are the following: 
bo = dk Das —dka. 
where we have introduced the two-dimensional tensor Kas = Jab, KÈ = gx... Of all the 
components of the Ricci tensor, the only one that does not vanish identically is Roo, so that 
we have the equation 


Roo = —ik$—- iki = 0. (109.7) 


Thus, the three functions g55U]). g23(4), 933(4) must satisfy just one equation. Therefore 
two of them can be chosen arbitrarily. It is convenient to write (109.7) in another form 
writing the g,, in the form 


da = —XYYa [Yal = I. (109.8) 


Then the determinant —g = |g,,| = x*, and substitution in (109.7) gives, after simple 
transformations, 


+H Yay” Viva )x = 0 (109.9) 


(y is the two-dimensional tensor reciprocal to ya). If we assign arbitrary functions y,,(7) 
(related to one another through the relation |y,,| = 1) these equations determine the 
function x(n). 

We thus arrive at a solution containing two arbitrary functions. It is easy to see that it is 
a generalization of the case considered in § 107 of a weak plane gravitational wave pro- 
pagating in one direction.t The latter is obtained if we make the transformation 


t+x | t=x 
n = ——, X = —— 
V2 V2 
and set Yas = Og, + Aan) (where the h, are small quantities, subject to the condition 
hy» +h33 = 0) and y = l; a constant value of y satisfies (109.9) if we neglect small second- 


order terms. 

Suppose that a weak gravitational wave of finite extent (a “wave packet”) is passing 
some point x. At the beginning of the passage we have A, = 0, y = l; at the end of the 
passage we again have ha = 0, 0? y/Ot? = 0, but the inclusion of second order terms in 
(109.9) leads to the appearance of a nonzero negative value of 0y/dr: 


1 [ha 
nies — |( x) dt «0 


(the integral is taken over the time of passage of the wave). Thus, after the wave has passed, 
y = | —const t, and after a finite time interval, y changes its sign. But vanishing of y means 
vanishing of the metric determinant g, i.e. a singularity in the metric. This singularity, 
however, is not physical in character; it is related only to the unsatisfactory nature of the 


t A solution of similar character in a larger number of variables is given in I. Robinson and A. Trautman, 
Phys. Rev. Lett. 4, 431 (1960); Proc. Roy. Soc. A 265, 463 (1962). 
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reference frame, "spoiled" by the passing gravitational wave, and can be eliminated by a 
suitable transformation; after passage of the gravitational wave, the space-time does 
actually become flat again. 

This can be shown directly. If we measure the variable 5 from its value corresponding to 
the singular point, y = n, so that 


ds? = 2dn dx! —n?[(dx?)? -- (dx?y']. 


After the transformation 


we get 
ds? = 2dn dé—dy* —dz?, 


and the substitution y = (t+ x)/ 5. č = (t— x)/ 42 finally brings the metric to galilean 
form. 

This property of the gravitational wave—the creation of a fictitious singularity, is, of 
course, not related to the fact that the wave is weak; it also applies to the general solution 
of (109.7); just as in the example considered, near the singularity y~n, i.e. —g~n*.+ 


PROBLEM 
Find the condition for a metric of the form 
ds? = dt? -dx? dy? —dz? -f(t —x, y, z)(dt—dx)* 


to be an exact solution of the Einstein equations for a field in vacuum (A. Peres, 1960). 
Solution: The Ricci tensor is calculated most simply in the coordinates u = (f—x) 4/2, v = 
= (t+x)//2, y, z, in which 


ds? = —dy? —dz? +2dudv +2flu, y, z)dw?. 


Aside from g22 = gss = —1, the only nonzero components of the metric tensor are g,, = 2f, 
g,, = l; then g" = —2f, g = 1, while the determinant g = —1. A direct calculation with 
(92.1) gives for the nonzero components of the curvature tensor: 
E e e 
E NE CM 
yuyu Oy? Zuzu 02? yuzu Oydz 


The only nonzero component of the Ricci tensor is R,, = Af, where A is the Laplacian in the 
coordinates y, z. Thus the Einstein equation is Af = 0, i.e. the function f(t —x, y, z) must be harmo- 
nic in the variables y, z. 

If f is independent of y and z, or linear in them, there is no field—the space time is flat (the 
curvature tensor vanishes). The function f(u, y, z) = yzfilu) + Hy? —27)f2(u), which is quadratic 
in y and z, corresponds to a plane wave propagating in the positive x direction; the curvature 
tensor in such a field depends only on t-x: 


Ryu = — fiu), Ry ST Rasa x — f(u). 


Corresponding to the two possible polarizat‘ons of the wave, the metric contains two arbitrary 
functions fi(u) and f;(u). 


t This can be shown using (109.7) in precisely the same way as in $ 97 for the analogous three-dimensional 
equation in the synchronous reference frame. Just as there, the appearance of a fictitious singularity is 
reiated to the crossing of coordinate curves. 
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§ 110. Radiation of gravitational waves 


Let us consider next a weak gravitational field, produced by arbitrary bodies, moving with 
velocities small compared with the velocity of light. 

Because of the presence of matter, the equations of the gravitational field will differ from 
the simple wave equation of the form [3A* = 0 (107.8) by having, on the right side of the 
equality, terms coming from the energy-momentum tensor of the matter. We write these 
equations in the form 


1 Szk 
5 Oi "X (110.1) 


where we have introduced in place of the h* the more convenient quantities 
yi = hi — 40th, 


and where 15 denotes the auxiliary quantities which are obtained upon going over from the 
cxact equations of gravitation to the case of a weak field in the approximation we are con- 
sidering. It is easy to verify that the components t) and 1? are obtained directly from the 
corresponding components T* by taking out from them the terms of the order of magnitude 
in which we are interested ; as for the components 14, they contain along with terms obtained 
from the T7, also terms of second order from R— 4ójR. t 

The quantities ij^ satisfy the condition (107.5) 0Jf/Ox* = 0. From (110.1) it follows that 
this same equation holds for the ti: 


k 
—=0, 110.2) 


This equation here replaces the general relation T£ , = 0. 

Using the equations which we have obtained, let us consider the problem of the energy 
radiated by moving bodies in the form of gravitational waves. The solution of this problem 
requires the determination of the gravitational field in the “wave zone”, i.e. at distances 
large compared with the wavelength of the radiated waves. 

In principle, all the calculations are completely analogous to those which we carried out 
for electromagnetic waves. Equation (110.1) for a weak gravitational field coincides in 
form with the equation of the retarded potentials ($ 62). Therefore we can immediately 
write its general solution in the form 


- [ena I 


Since the velocities of all the bodies in the system are small, we can write, for the field 
at large distances from the system (see §§ 66 and 67), 


y=- B i; | bte ar, (110.4) 
0 


+ From eqs. (110.1) we can again obtain the formulas (106.1-2) that were used in § 106 for the weak 
constant field far from bodies. In the first approximation we neglect terms with second time derivatives 
(containing | /¢?). and of all the components of T . only 7$ = uc? remains. The solution of the equations 
M = 0, M, = 0. Ade = l6rk uic? that vanishes at infinity is = =0. y = 0, 4 = 4élc, where ¢ is the 
Newtonian gravitational potential; cf. (99.2). One then finds for the tensor k = y —MWó: the values 
(106.1-2). 
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where Rọ is the distance from the origin, chosen anywhere in the interior of the system. 
From now on we shall, for brevity, omit the index t — (Ro/c) in the integrand. 

For the evaluation of these integrals we use equation (110.2). Dropping the index on the 
t% and separating space and time components, we write (110.2) in the form 


Ôtay _ Ĉteo Lo, foy _ Oran 
Ox’ Ox? : ax’? Ox? 


= 0. (110.5) 


Multiplying the first equation by x^, we integrate over all space, 


B 
zs i Tj 0x8 dV = IE xf dV = [5 dV — Í Tag dV. 


Since at infinity c, = 0, the first integral on the right, after transformation by Gauss’ 
theorem, vanishes. Taking half the sum of the remaining equation and the same equation 
with transposed indices, we find 


i dV = — T ss | Ce t tgo X^) dV. 


Next, we multiply the second equation of (110.5) by x*x*, and again integrate over all 
space. An analogous transformation leads to 


ð 
F sas | toor xP dV = - | (ox! +T pox") dV. 


Comparing the two results, we find 


2 6x3 


Thus the integrals of all the t, g appear as expressions in terms of integrals containing only 
the component Too. But this component, as was shown earlier, is simply equal to the corres- 
ponding component Too of the energy-momentum tensor and can be written to sufficient 
accuracy [see (99.1)] as: 


12? 
f: Tap dV = Too x"xf dV. (110.6) 


too = u£?. (110.7) 
Substituting this in (110.6) and introducing the time t = x°/c, we find for (110.4) 


2k ô? 
c^R, at? 


At large distances from the bodies, we can consider the waves as plane (over not too large 
regions of space). Therefore we can calculate the flux of energy radiated by the system, say 
along the direction of the x! axis, by using formula (107.12). In this formula there enter the 


components /i24 = W23 and /35—A33 = Y22— 33. From (110.8), we find for them the 
expressionst 


Yag = — 5 | uxt x? dV. (110.8) 


x 2k 8 x. 
—— D, hj,—h44 = — —— (D;,—D ; 
Q0 22733 3c R; (D22— D33) (110.9) 


t The tensor (110.8) does not satisfy the conditions under which formula (107.12) was derived. However, 
the transformation of reference frame that brings the A,, to the required gauge does not affect the values 
of the components of (110.9) that are used here. 
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(the dot denotes time differentiation), where we have introduced the mass quadrupole 
tensor (99.8): 


Dag = f u(3x*x* ~ 5,57?) dV. (110.10) 
As a result, we obtain the energy flux along the x! axis in the form 
k B2.-D33\’ , = 
t9 = — | ( ** ) + D3, |. (110.11) 
"= 36nc8R2 ( 2 ) K 3 


The flux of energy into an element of solid angle in the given direction is then obtained 
by multiplying by R2 do. 

The two terms in this expression correspond to the radiation of waves of two independent 
polarizations. To write them in invariant form (independent of the choice of the direction 
of radiation) we introduce the three-dimensional unit polarization tensor e,, of the plane 
gravitational wave, which determines the nonzero components of h„ș (in the gauge for the 
hj, in which ho, = hog = A = 0). The polarization tensor is symmetric and satisfies the 
conditions 

Car = 0, eng = O, ege. = I, (110.12) 
where n is a unit vector in the direction of propagation of the wave. 

Using this tensor we can write the intensity of radiation of a given polarization into 
solid angle do in the form 
NEP 

This expression depends implicitly on the direction of n through the transversality con- 
dition e, = 0. The total angular distribution for all polarizations is gotten by summing 
(110.13) over polarizations, or, what is equivalent, averaging over polarization and multiply- 
ing by 2 (the number of independent polarizations). The averaging is done using the formula 


dl (Daplap)? do. (110.13) 


Epl = din,ngn,ns (n,n5Ó,,  n,n4Ó,5) — 
— (n,n,Óg, + ngn,Ó,4 - n,n5Ós, + ngn4,,) — 
— 648045 t (0,,0p5 + 98,505) (110.14) 


(the expression on the right is a tensor formed from the unit tensor and the components of 
the vector n; it has the required symmetry in its indices, it. gives unity on contraction on 
pairs of indices x, y and fj, ô, and vanishes after scalar multiplication with n). 

The result is 


k lo in "RP 
~ 36nc) li (Day n, ng + 5 Dag—DapDayns | do. (110.15) 


The total radiation in all directions, i.e., the energy loss of the system per unit time 
( — dé/|dt), can be found by averaging d//do over all directions and multiplying the result 
by 4n. The averaging is easily performed using the formulas given in the footnote on p. 189, 
and gives 


dl 


- — =; D. 110.16 
dt 45-5 " ) 
We note that the radiation of gravitational waves is a fifth order effect in l/c. This fact, 

together with the smallness of the gravitational constant k, makes the usual effects extremely 

small. 
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PROBLEMS 


1. Two bodies, attracting each other according to Newton's law, move in circular orbits (around 
their common center of inertia). Determine the average (over a rotation period) of the intensity 
of radiation of gravitational waves and its distribution in polarization and direction. 

Solution: Choosing the coordinate origin at the center of inertia, we have for the radius vectors 
of the two bodies: 

Ha m, 


T; = T, T: : 
m,-cm; m,m; 


T, T —T,-—I;. 


The components of the tensor D,, are (if the xy plane coincides with the plane of motion): 


D, = ur'(3cos? p - 1, D,, = ur(3 sin? v —1), 


xx y 


D., = 3ur? cos y sin y, D,, = —pur?, 


where u = m™,m2/(m,+mz), v is the polar angle of the vector r in the xy plane. For circular motion 
r = const, and y =r-?Vk(m, +m) zo. ' 

We assign the direction of n by the polar angle 9 and azimuth ¢, with the polar axis z perpendi- 
cular to the plane of the motion. Let us consider the two polarizations for which: (1) e,, = 1/ v2; 
(2) eoo = —e,, = 1/v2. Projecting the tensor D,, on the directions of the spherical unit vectors 
e, and e,, calculating with formula (110.13) and averaging over the time, we find the result for 
these two cases and for the sum / = /, +/2: 


dl; |. kp 206, 


di, — ku^wer* 3 
don 0, — — —— (1 28) 
do 2nc* 4 cost 0, do 2ac* RUE 


di —Okuofr* 
do — 2n’ 
and after integrating over all directions: 
l dé -= 32ku'e?r* 32k*m?mi(m, +m?) li 5 
dr 5c? Secor? ^ fi 7 


(1 +6 cos? 6 +cos* 0), 


[for calculating the total intensity / alone, we should, of course, have used (110.16)]. 
The loss of energy from the radiating system leads to a gradual (secular) approach of the two 


bodies. Since € = —km,m;/2r, the velocity of approach is 
p= 2r dé _ | OA? m mi(m, +m2) 
km,m; dt Secor? ` 


2. Find the average (over a rotation period) of the energy radiated in the form of gravitational 
waves by a system of two bodies moving in elliptical orbits (P. C. Peters and J. Mathews).t 
Solution: In contrast to the case of circular motion, the distance r and the angular velocity vary 
along the orbit according to the laws 
a(l —e?) dy 


! 
; = ] +e cos v, dr — p [km +m;)a(t —e?)}t, 


where e is the eccentricity and a is the semimajor axis of the orbit (cf Mechanics, § 15). A quite 

lengthy calculation using (110.16) gives: 
dó = 8k*milmil(m, +m2) 
dt 15a5c*(1 Ze?) 


(1 +e cos y)* [1201 +e cos y)? +e? sin? y]. 


In averaging over the period of rotation, the integration over t is replaced by integration over y, 


1 For the angular, polarization, and spectral distributions of this radiation, cf. PAys. Rev. 131, 435 (1963). 
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and gives the result: 


_dé | 32k*mimiUn, rma) l (+5 1 
di 5cta* 0 =e?) EL z e) 


We note the rapid increase in intensity of radiation with increasing eccentricity of the orbit. 

3. Determine the time-averaged rate of loss of angular momentum from a system of bodies in 
stationary motion and emitting gravitational waves. 

Solution: For convenience of writing formulas, we temporarily regard the body as consisting 
of discrete particles. We represent the average rate of loss of energy of the system as the work of 
the “frictional forces" f acting on the particles: 

-dE or 
ü = Xf-v a) 


(we omit the index labeling the particles). Then the average rate of loss of angular momentum is 
given by 
dr = Lr xf), = Le. xf, (2) 


(cf. the derivation of formula (75.7)). To determine f, we write 


dé k ay k 4 
m lI =- (v) 
dt 45c5 D. D., 45c$ D. D; 

(where we have used the fact that the average values of total time derivatives vanish). Substituting 


D,, = Xm(3x,v,3x,u, - 2t: v9,,) and comparing with (1), we find: 


Substitution of this expression in (2) gives the result: 


iM, 2k ko 
TW 7 $e e," D54D,, 7 456 €» Pps Dor (3) 
4. Fora system of two bodies moving in elliptical orbits, find the average loss of angular momen- 


tum per unit time. 
Solution: A calculation with formula (3) of the preceding problem, analogous to that done 
in problem 2, gives the result: 


dM, B 32k? mimi V m, mi 1 ;( 7 
d 5cia* d —e?)? 3I 


For circular motion (e = 0) the values of & and M are, as they should be, related by & = Mo. 


CHAPTER 14 


RELATIVISTIC COSMOLOGY 


§ 111. Isotropic space 


The general theory of relativity opens new avenues of approach to the solution of prob- 
lems related to the properties of the universe on a cosmic scale. The new remarkable 
possibilities which arise are related to the non-galilean nature of space-time (first noted by 
Einstein in 1917). 

Before proceeding to a systematic construction of relativistic cosmological models, 
we make the following comment concerning the basic field equations from which we start. 

The requirements formulated in § 93 as conditions for determining the action for the 
gravitational field will still be satisfied if we add a constant term to the scalar G, i.e. if we set 

3 
S, = ——— | (G+2A)/—g dQ, 
o = — Tonk f S 
where A is a new constant (with dimensions cm ?). Such a change leads to the appearance 
in the Einstein equations of an auxiliary term Ag 4: 


8nk 
Ry —4RGn = P" T, + Agi. 


If one ascribes a small value to the “cosmological constant" A, the presence of this term 
will not significantly affect gravitational waves over not too large regions of space-time, 
but it will lead to the appearance of new types of “cosmological solutions" which could 
describe the universe as a whole.t At the present time, however, there are no cogent and 
convincing reasons, observational or theoretical, for such a change in the form of the 
fundamental equations of the theory. We emphasize that we are talking about changes that 
have a profound physical significance: introducing into the Lagrange density a constant 
term which is generally independent of the state of the field would mean that we ascribe 
to space-time a curvature which cannot be eliminated in principle and is not associated 
with either matter or gravitational waves. All of the remaining presentation in this section 
is therefore based on the Einstein equations in their classical form without the “cosmological 
constant". 

We know that the stars are distributed over space in a very nonuniform manner; they 
are concentrated into separate star systems (galaxies). But in studying the universe on a 
"Jarge scale" one should disregard "local" inhomogeneities produced by the condensation 


1 In particular, one can now have stationary solutions which do not occur for .\ = 0. It was precisely 
for this reason that the “cosmological term" was introduced by Einstein before the discovery by Friedmann 
of nonstationary solutions of the field equations—cf. below. 
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of matter into stars and star systems. Thus, by the mass density we should understand the 
density averaged over regions of space whose dimensions are large compared to the distances 
between galaxies. 

The solutions of the Einstein equations considered later (in $8 111-114), the so-called 
isotropic cosmological model (first discovered by A. A. Friedmann in 1922), are based on 
the assumption of homogeneity and isotropy of the distribution of matter in space. Existing 
astronomical data do not contradict such an assumption,+ and at present there is every 
reason to believe that in general terms the isotropic model gives an adequate description 
not only of the present state of the universe but also of a significant part of its evolution 
in the past. We shall see below that the main feature of this model is its nonstationarity. 
There is no doubt that this property ("the expanding universe") gives a correct explanation 
of the phenomenon of the red shift, which is fundamental for the cosmological problem 
(8 114). 

At the same time it is clear that. by its very nature, the assumption of homogeneity and 
isotropy of the universe can have only an approximate character, since these properties 
surely are not valid if we go to a smaller scale. We shall turn to the question of the possible 
role of inhomogeneities of the universe in various aspects of the cosmological problem 
in $8 115-119. 

The homogeneity and isotropy of space mean that we can choose a world time so that 
at each moment the metric of the space is the same at all points and in all directions. 

First we take up the study of the metric of the isotropic space as such, disregarding for 
the moment any possible time dependence. As we did previously, we denote the three- 
dimensional metric tensor by y,,, i.e. we write the element of spatial distance in the form 


dU = yy, dx" dx. (LLL) 


The curvature of the space is completely determined by its three-dimensional curvature 
tensor, which we shall denote by P$, in distinction to the four-dimensional tensor Riim- 
In the case of complete isotropy, the tensor P5,, must clearly be expressible in terms of 
the metric tensor y,, alone. lt is easy to see from the symmetry properties of P5,, that it 
must have the form: 


Pragya = Ia 95 — 7367 sp) (111.2) 
where 2 is some constant. The Ricci tensor P,y = Pi, is accordingly equal to 


Pag = 2Àyag (111.3) 
and the scalar curvature 
P = 64. (111.4) 


Thus the curvature properties of an isotropic space are determined by just one constant. 
Corresponding to this there are altogether three different possible cases for the spatial 
metric: (1) the so-called space of constant positive curvature (corresponding to a positive 
value of A), (2) space of constant negative curvature (corresponding to values of 4«0), 
and (3) space with zero curvature (A = 0). Of these, the last will be a flat, i.e. euclidean, 
space. 


+ We have in mind data on the distribution of galaxies in space and on the isotropy of the so-called 
background radio radiation. 
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To investigate the metric it is convenient to start from geometrical analogy, by considering 
the geometry of isotropic three-dimensional space as the geometry on a hypersurface known 
to be isotropic, in a fictitious four-dimensional space.t Such a space is a hypersphere; the 
three-dimensional space corresponding to this has a positive constant curvature. The 
equation of a hypersphere of radius a in the four-dimensional space x,, X2, X3, x4, has the 
form 

x? +x +x +x =a’, 
and the element of length on it can be expressed as 
dl? = dx? +dx?4dx3+4dx?. 


Considering x,, X2, x4 as the three space coordinates, and eliminating the fictitious co- 
ordinate x, with the aid of the first equation, we get the element of spatial distance in the 
form 
(x, dx, x4 dx, x4 dx? 


d? = dx? +dx?+dx? + 
! ? 3 a?-xi-xi-xi 


(111.5) 


From this expression, it is easy to calculate the constant A in (111.2). Since we know 
beforehand that P,, has the form (111.3) over all space, it is sufficient to calculate it only for 
points located near the origin, where the y,, are equal to 


Xa Xg 
a? ` 


Yag = Sap + 


Since the first derivatives of the y,,, and consequently the quantities 4%., vanish at the 
origin, the calculation from the general formula (92.7) turns out to be very simple and gives 
the result 


1 
A=. (111.6) 


We may call the quantity a the ‘‘radius of curvature" of the space. We introduce in place 
of the coordinates x,, x2, X3, the corresponding "spherical" coordinates r, 0, 6. Then the 
line element takes the form 

dr? PAP. 2 2 
; t r^(sin^ 0 do^ -- d0^). (111.7) 
pH 
1-— 


a? 


dl = 


The coordinate origin can of course be chosen at any point in space. The circumference of a 
circle in these coordinates is equal to 27r, and the surface of a sphere to 4zr?. The “radius” 
of a circle (or sphere) is equal to 


: dr a 
[3 - 0 sin (r/a), 


that is, is larger than r. Thus the ratio of circumference to radius in this space is less than 27. 
Another convenient form for the d/? in “four-dimensional spherical coordinates” is 


T This four-space is understood to have nothing to do with four-dimensional space-time. 
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obtained by introducing in place of the coordinate r the “angle” y according to r =a sin y 
(y goes between the limits 0 to z).¢ Then 


dl? = a?[dy? - sin? y(sin? 0 d$? 4- 40?)]. (111.8) 
The coordinate y determines the distance from the origin, given by ay. The surface of a 
sphere in these coordinates equals 4ra? sin? y. We see that as we move away from the 
origin, the surface of a sphere increases, reaching its maximum value 47a? at a distance 
of za/2. After that it begins to decrease, reducing to a point at the "opposite pole" of the 
space, at distance za, the largest distance which can in general exist in such a space [all this 
is also clear from (111.7) if we note that the coordinate r cannot take on values greater 
than a]. 
The volume of a space with positive curvature is equal to 
2n a 


"i 


f a? sin? y sin 0 dy dO dd. 
0 


so that 
V = 27?a?. (111.9) 


Thus a space of positive curvature turns out to be “closed on itself”. Its volume is finite 
though of course it has no boundaries. 

It is interesting to note that in a closed space the total electric charge must be zero. 
Namely, every closed surface in a finite space encloses on each side of itself a finite region 
of space. Therefore the flux of the electric field through this surface is equal, on the one hand, 
to the total charge located in the interior of the surface, and on the other hand to the total 
charge outside of it, with opposite sign. Consequently, the sum of the charges on the two 
sides of the surface is zero. 

Similarly, from the expression (96.16) for the four-momentum in the form of a surface 
integral there follows the vanishing of the total four-momentum P' over all space. Thus the 
definition of the total four-momentum loses its meaning, since the corresponding con- 
servation law degenerates into the empty identity 0 = 0. 

We now go on to consider geometry of a space having a constant negative curvature. 
From (111.6) we see that the constant 4 is negative if a is imaginary. Therefore all the 
formulas for a space with negative curvature can be immediately obtained from the pre- 
ceding ones by replacing a by ia. In other words, the geometry of a space with negative 
curvature is obtained mathematically as the geometry on a four-dimensional pseudosphere 
with imaginary radius. 


Thus the constant 4 is now 


ae (111.10) 


- 


and the element of length in a space of negative curvature has, in coordinates r, 0, $, the 
form 
dr? 


dP = 2 +r?(sin? 0 d$? +d6"), QI 
1+ 3 
a 


T The “cartesian” coordinates xi, x2, xa, x4 are related to the four-dimensional spherical coordinates 
a, 0, à, x by the relations: 
xı =asin x sin 0 cos ¢, Xa — asin y sin @ sin ¢, 
xa =a sin y cos 6, X4 = a COS x. 
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where the coordinate r can go through all values from 0 to oo. The ratio of the circumference 
of a circle to its radius is now greater than 2z. The expression for d/? corresponding to 
(111.8) is obtained if we introduce the coordinate y according to r = a sinh x (x here goes 
from 0 to oo). Then 
dl? = a? (dy? t sinh? x(sin? 0 d$? + 40?)). (111.12) 
The surface of a sphere is now equal to 4na* sinh? y and as we move away from the 
origin (increasing x), it increases without limit. The volume of a space of negative curvature 
is, clearly, infinite. 


PROBLEM 


Transform the element of length (111.7).to a form in which it is proportional to its euclidean 
expression (conformal-euclidean coordinates). 
Solution: The substitution 


leads to the result: 


2\-2 
dl? = (1+ ia) (dr2-- r2 d0?+ r? sin? 8 - dé?). 


$ 112. The closed isotropic model 


Going on now to the study of the space-time metric of the isotropic model, we must 
first of all make a choice of our reference system. The most convenient is a *'co-moving" 
reference system, moving, at each point in space, along with the matter located at that 
point. In other words, the reference system is just the matter filling the space; the velocity 
of the matter in this system is by definition zero everywhere. It is clear that this reference 
system is reasonable for the isotropic model—for any other choice the direction of the 
velocity of the matter would lead to an apparent nonequivalence of different directions in 
space. The time coordinate must be chosen in the manner discussed in the preceding 
section, i.e. so that at each moment of time the metric is the same over all of the space. 

In view of the complete equivalence of all directions, the components goa of the metric 
tensor are equal to zero in the reference system we have chosen. Namely, the three com- 
ponents go, can be considered as the components of a three-dimensional vector which, if it 
were different from zero, would lead to a nonequivalence of different directions. Thus ds? 
must have the form ds? = goo (dx?)? — d?. The component goo is here a function only of x°. 
Therefore we can always choose the time coordinate so that goo reduces to 1. Denoting it 
by ct, we have 

ds? = c? df — dV. (112.1) 


This time 7 is the synchronous proper time at each point in space. 

Let us begin with the consideration of a space with positive curvature; from now on we 
shall, for brevity, refer to the corresponding solution of the Einstein equations of gravitation 
as the "closed model". For dl we use the expression (111.8) in which the "radius of curvature" 
a is, in general, a function of the time. Thus we write ds? in the form 


ds? = c? d? — a*(r) (dy? sin? x(d0? t sin? 0: d$?)). (112.2) 


§ 112 THE CLOSED ISOTROPIC MODEL 363 


The function a(t) is determined by the equations of the gravitational field. For the solu- 
tion of these equations it is convenient to use, in place of the time, the quantity y defined by 
the relation 


c dt =a dy. (112.3) 
Then ds? can be written as 


ds? = a?(n){dn? — dy? — sin? y(d0? + sin? 0- d$?)). (112.4) 


To set up the field equations we must begin with the calculation of the components of 
the tensor R; (the coordinates x°, x!, x?, x? are y. y, 0, à). Using the values of the com- 
ponents of the metric tensor, 


2 2 2 ein? 2 (iQ? tae 
Joo = 4, gı —4, 922 = —a sin” y, 933 = —a^ sin® y sin" 0, 
we calculate the quantities I1: 


, r , 


a a a 
0 uU a a 0 a 
Foo = a V = EP Gaps op = a p Tio = Foo = 0, 


where the prime denotes differentiation with respect to 7. (There is no need to compute the 
components I5, explicitly.) Using these values, we find from the general formula (92.7): 


3 
R$ = — (a? — aa"). 
a 


From the same symmetry arguments as we used earlier for the goa it is clear from the start 
that Ro, = 0. For the calculation of the components R^ we note that if we separate in them 
the terms containing only g,, (i.e. only the P5,), these terms must constitute the compo- 
nents of a three-dimensional tensor — Pj, whose values are already known from (111.3) 
and (111.6): 


2 
RPhíBÉ-PÉ...-——-M.., 
a 
where the dots represent terms containing goo in addition to the g,g. From the computation 
of these latter terms we find: 
1 
R&= — E (2a? +a’? + aa^)é^, 
so that 
0 a 6 " 
R = Ro+ Ri = — = (a +a”). 
a 
Since the matter is at rest in the frame of reference we are using, u^ = 0, u° = l/a, and 


we have from (94.9) T6 = e, where e is the energy density of the matter. Substituting these 
expressions in the equation 


1 82k 
Ro- 2 R =-7 To, 
we obtain: 
8nk 3 
A = 5 (a? +a). (112.5) 


Here there enter two unknown functions e and a; therefore we must obtain still another 
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equation. For this is convenient to choose (in place of the spatial components of the field 
equations) the equation T>,; = 0, which is one of the four equations (94.7) contained, as 
we know, in the equations of gravitation. This equation can also be derived directly with 
the help of thermodynamic relations, in the following fashion. 

When in the field equations we use the expression (94.9) for the energy-momentum 
tensor, we are neglecting all those processes which involve energy dissipation and lead to an 
increase in entropy. This neglect is here completely justified, since the auxiliary terms which 
should be added to T* in connection with such processes of energy dissipation are negligibly 
small compared with the energy density e, which contains the rest energy of the material 
bodies. 

Thus in deriving the field equations we may consider the total entropy as constant. We 
now use the well-known thermodynamic relation d? = T d$—p dV, where &, S, V, are 
the energy, entropy, and volume of the system, and p, T, its pressure and temperature. At 
constant entropy, we have simply d? = — p dV. Introducing the energy density c = &/V 
we easily find 


dV 
de = —(e+p) Y 


The volume V of the space is, according to (111.9). proportional to the cube of the radius of 
curvature a. Therefore dV/V = 3da/a =3d(In a), and we can write 


de 
— —— =3d(l a 
e+p doa 


or, integrating, 


d 
3Ina = — | -fÈ +const (112.6) 
pte 


(the lower limit in the integral is constant). 

If the relation between £ and p (the “equation of state" of the matter) is known, then 
equation (112.6) determines € as a function of a. Then from (112.5) we can determine n 
in the form 


y= +| — (112.7) 
NINE yc 
4.4 £a 
Equations (112.6-7) solve, in general form, the problem of determining the metric in the 
closed isotropic model. 

If the material is distributed in space in the form of discrete macroscopic bodies, then to 
calculate the gravitational field produced by it, we may treat these bodies as material particles 
having definite masses, and take no account at all of their internal structure. Considering 
the velocities of the bodies as relatively small (compared with c), we can set € = uc?, where 
u is the sum of the masses of the bodies contained in unit volume. For the same reason 
the pressure of the “gas” made up of these bodies is extremely small compared with e, 
and can be neglected (from what we have said, the pressure in the interior of the bodies has 
nothing to do with the question under consideration). As for the radiation present in space, 
its amount is relatively small, and its energy and pressure can also be neglected. 
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Thus, to describe the present state of the Universe in terms of this model, we should use 
the equation of state for *dustlike" matter, 


e=puc?, p=0. 


The integration in (112.6) then gives ua? = const. This equation could have been written 
immediately, since it merely expresses the constancy of the sum M of the masses of the bodies 
in all of space, which should be so for the case of dustlike matter.f Since the volume of space 
in the closed model is V = 2z?a?, const = M/2n?. Thus 


M 
a? = const = —;. 112.8 
m const = 77 ( ) 
Substituting (112.8) in equation (112.7) and performing the integration, we get: 
a = ag(1— cos m, (112.9) 
where the constant 
a |. 2kM 
° Bre? 


Finally, for the relation between t and y we find from (112.3): 
t = 29 (n — sin n). (112.10) 
c 


The equations (112.9-10) determine the function a(t) in parametric form. The function 

a(t) grows from zero at 1 = 0 (y = 0) to a maximum value of a = 2a,, which is reached 

when t = zaj/c (n = x), and then decreases once more to zero when ! = 2zao/c (n = 2x). 
For 5 < 1 we have approximately a = ao 1?/2, t = ag n? [6c, so that | 


9a, c2\ 1/3 
T 12/3, (112.11) 
The matter density is 
1 8 x 10? 


(where the numerical valuc is given for density in gm: cm"? and t in sec). We call attention 
to the fact that in this limit the function j(t) has a universal character in the sense that it 
does not depend on the parameter aj. 

When a — 0 the density u goes to infinity. But as x > oo the pressure also becomes large, 
so that in investigating the metric in this region we must consider the opposite case of 
maximum possible pressure (for a given energy density e), i.e. we must describe the matter 
by the equation of state 


= 
P=3 
(see the footnote on p. 87). From formula (112.6) we then get: 
3c*a? 
s a* = const = : 112.13 
£a cons m ( ) 


t To avoid misunderstandings (that might arise if one considers the remark in § 111 that the total four- 
momentum of a closed universe is zero), we emphasize that M is the sum of the masses of the bodies taken 
one by one, without taking account of their gravitational interaction. 
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(where a, is a new constant), after which equations (112.7) and (112.3) give the relations 
a — d, Sin fj, t = = (1—cos n). 


Since it makes sense to consider this solution only for very large values of e (i.e. small a), 
we assume y <1. Then a x a, n, t x a,n?/2c, so that 

a — 2a, ct. (112.14) 
Then 


£ 3 4.5 x 105 


à 3xmu) P (112.15) 


(which again contains no parameters). 

Thus, here too a — 0 for t — 0, so that the value t = 0 is actually a singular point of the 
space-time metric of the isotropic model (and the same remark applies in the closed model 
also to the second point at which a = 0). We also see from (112.14) that if the sign of t is 
changed, the quantity a(t) would become imaginary, and its square negative. All four 
components g, in (112.2) would then be positive, and the determinant g would be positive. 
But such a metric is physically meaningless. This means that it makes no sense physically 
to continue the metric analytically beyond the singularity. 


$ 113. The open isotropic model 


The solution corresponding to an isotropic space of negative curvature (“open model”) 


is obtained by a method completely analogous to the preceding. In place of (112.2), we now 
have 


ds? = c? dt? — a?(r)(dy? + sinh? y(d6? + sin? 0 d$?)]. (113.1) 
Again we introduce in place of ¢ the variable 7, according to c dt = a dy; then we get 
ds? = a?(n){dn? — dy? —sinh? y(d0? + sin? 8: d$?)1. (113.2) 


This expression can be obtained formally from (112.4) by changing n, y, a respectively to 
in, iy, ia. Therefore the equations of the field can also be gotten directly by this same sub- 
situation in equations (112.5) and (112.6). Equation (112.6) retains its previous form: 


sina =~ f -$= +const, (113.3) 
e+p 
while in place of (112.5), we have 

8nk 3 

P e — 75 (a^ — a^. (113.4) 


Corresponding to this we find, instead of (1 2n 


n= = f JEn 


(113.5) 
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For material in the form of dust, we find: 


a = a,(cosh n—1), t=% (sinh y—n), (113.6) 
a= e 113.7 
H = gnk € (113.7) 


The formulas (113.6) determine the function a(t) in parametric form. In contrast to the 
closed model, here the radius of curvature changes monotonically, increasing from zero 
at’ = 0 (4 = 0) to infinity for 100 (g— oc ). Correspondingly, the matter density decreases 
monotonically from an infinite value when + = 0 (when y< 1l, the monotonic decrease 
is given by the same approximate formula (112.12) as in the closed model). 

For large densities the solution (113.6-7) is not applicable, and we must again go to the 
case p = e/3. We again get the relation 


ta^ = const = - Mug (113.8) 
8nk 
and find for the function a(t): 
a=a, sinhn, t =“! (cosh y- 1) 
or, when 5 < 1, ° 
a = 2a, ct (113.9) 


[with the earlier formula (112.15) for &(1)]. Thus in the open model, also, the metric has a 
singularity (but only one, in contrast to the closed model). 

Finally, in the limiting case of the solutions under consideration, corresponding to an 
infinite radius of curvature of the space, we have a model with a flat (euclidean) space. The 
interval ds? in the corresponding space-time can be written in the form 


ds? = c? dt? —b?(t(dx? +. dy? + dz?) (113.10) 


(for the space coordinates we have chosen the "'cartesian" coordinates x, y, z). The time- 
dependent factor in the element of spatial distance does not change the euclidean nature of 
the space metric, since for a given t this factor is a constant, and can be made unity by a simple 
coordinate transformation. A calculation similar to those in the previous paragraph leads to 


t We note that, by the transformation 
r — Ae" sinh y, ct = Ae" cosh y, 


= r 
Ae? = V chr, tanh y = —, 
ct 


the expression (113.2) is reduced to the "conformal-galilean" form 
ds? = f(r, Dic? dc? — dr? —r?(d0? | sin? 0 d?)]. 
Specifically, in the case of (113.6), setting A equal to ao/2, 


4 
ds? = ( - a ) [c? dr? — dr? — r?(d0? + sin? 0 d?) 


(V. A. Fock, 1955). For large values of V c?:?—r? (which correspond to 7 3» 1), this metric tends toward a 
galilean form, as was to be expected since the radius of curvature tends toward infinity. 

In the coordinates r, 0. ¢, 7, the matter is not at rest and its distribution is not uniform; the distribution 
and motion of the matter turns out to be centrally symmetric about any point of space chosen as the origin 
of coordinates r, 0, ¢. 
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the following equations: 


8xk 3 /db\? de 
7»): 3inb = = f +const 
For the case of low pressures, we find 
ub? = const, b = const £”. (113.11) 


For small t we must again consider the case p = £/3, for which we find 


eb* = const, b = const Vt. (113.12) 


Thus in this case also the metric has a singular point (t = 0). 

We note that all the isotropic solutions found exist only when the matter density is 
different from zero; for empty space the Einstein equations have no such solutions. We 
also mention that mathematically they are a special case of a more general class of solutions 
that contain three physically different arbitrary functions of the space coordinates (see the 
problem). 


PROBLEM 
Find the general form near the singular point for the metric in which the expansion of the space 
proceeds ‘‘quasihomogeneously”, i.e. so that all components yas = —£., (in the synchronous 


reference system) tend to zero according to the same law. The space is filled with matter with the 
equation of state p = e/3 (E. M. Lifshitz and I. M. Khalatnikov, 1960). 
Solution: We look for a solution near the singularity (t = 0) in the form: 
Yas = lQastt*bast ..., a) 
where aas and bas are functions of the (space) coordinates] ; below, we shall set c = 1. The reciprocal 
tensor is 


y^ — : a7? — b°’, 


where the tensor a°’ is reciprocal to a«s, while b"? = a*’a**b,,; all raising and lowering of indices 
and covariant differentiation is done using the time-independent metric aas. 
Calculating the left sides of equations (97.11) and (97.12) to the necessary order in 1/1, we get 


3 1 82k à 
= ag 3? =z e(—4up+1), 
1 
el ea 
2 
(where b = 62). Also using the identity 


Elia Uy 


] 
lau  u— z Hausa“, 


we find: 


3 b Ü 
mL. = —— i. 7 5 r 2 
8nke i r uy (b, — bi) Q) 


+ For e=0 we would get from (113.5) a = aoe” = ct [whereas the equations (112.7) are meaningless 

because the roots are imaginary]. But the metric 
ds? = c? dt? — c? t? (dx? - sinh? x(d0?-F- sin? 0 d$?) ) 
can be transformed by the substitution r = ct sinh x, t = f cosh x, to the form 
ds? = c*dt? — dr? —r?(d0? -- sin? 0 d$?), 

i.e. to a galilean space-time. 

1 The Friedmann solution corresponds to a special choice of the functions aag, corresponding to a space 
of constant curvature. 
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The three-dimensional Christoffel symbols, and with them the tensor Pas, are independent of the 
time in the first approximation in 1/r; the Pas coincide with the expressions obtained when calculating 
simply with the metric a.s. Using this, we find that in equation (97.13) the terms of order 1^? 
cancel, while the terms ~ 1/7 give 


3 5 
EL Pb = E 
. Pot gbat 73 506 0, 
from which 
4 5 
a. pi. = sip 
b; 3 P; 18 Hs (3) 
(where P = a^'P,,). In view of the identity 
] 
Pia 5 Pre =0 
[see (92.10)] the relation 
"ED 
aif ~~ 9 ie 
is valid, so that the «a Can be written in the form: 
t? 
= ——b.,. 
U, 9 Os (4) 


Thus, all six functions a,5 remain arbitrary, while the coefficients bas of the next term in the 
expansion (1) are determined in terms of them. The choice of the time in the metric (1) is completely 
determined by the condition f — 0 at the singularity; the space coordinates still permit arbitrary 
transformations that do not involve the time (which can be used, for example, to bring aag to 
diagona! form). Thus the solution contains all together three “physically different” arbitrary 
functions. 

We note that in this solution the spatial metric is inhomogeneous and anisotropic, while the 
density of the matter tends to become homogeneous as r—>0. In the approximation (4) the three- 
dimensional velocity v has zero curl, while its magnitude tends to zero according to the law 


v? = Va Vg yt? e £3, 


§ 114. The red shift 


The main feature characteristic of the solutions we have considered is the nonstationary 
metric; the radius of curvature of the space is a function of the time. A change in the radius 
of curvature leads to a change in all distances between bodies in the space, as is already seen 
from the fact that the element of spatial distance d/ is proportional to a. Thus as a increases 
the bodies in such a space "run away” from one another (in the open model, increasing a 
corresponds to 540, and in the closed model, to 0«5« z). 

From the point of view of an observer located on one of the bodies, it will appear as if 
all the other bodies were moving in radial directions away from the observer. The speed of 
this "running away" at a given time t is proportional to the separation of the bodies. 

This prediction of the theory must be compared with a fundamental astronomical fact— 
the red shift of lines in the spectra of galaxies. If we regard this as a Doppler shift, we arrive 
at the conclusion that the galaxies are receding, i.e. at the present time the Universe is 
expanding.T 


t The conclusion that the bodies are running away with increasing a(t) can only be made if the energy of 
interaction of the matter is small compared to the kinetic energy of its motion in the recession; this condition 
is always satisfied for sufficiently distant galaxies. In the opposite case the mutual separations of the bodies 
is determined mainly by their interactions; therefore, for example, the effect considered here should have 
practically no influence on the dimensions of the nebulae themselves, and even less so on the dimensions of 
stars. 
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Let us consider the propagation of a light ray in an isotropic space. For this purpose it is 
simplest to use the fact that along the world line of the propagation of a light signal the 
interval ds = 0. We choose the point from which the light emerges as the origin of co- 
ordinates y, 0, 6. From symmetry considerations it is clear that the light ray will propagate 
“radially”, i.e. along a line 0 = const, $ = const. In accordance with this, we set 0 = dd = 0 
in (112.4) or (113.2) and obtain ds? = a*(dn?—dy’). Setting this equal to zero. we find 
dy = +dy or, integrating, 

y = +n+const. (114.1) 


The plus sign applies to a ray going out from the coordinate origin, and the minus sign to a 
ray approaching the origin. In this form, equation (114.1) applies to the open as well as to 
the closed model. With the help of the formulas of the preceding section, we can from this 
express the distance traversed by the beam as a function of the time. 

In the open model, a ray of light, starting from some point, in the course of its propaga- 
tion recedes farther and farther from it. In the closed model, a ray of light, starting out from 
the initial point, can finally arrive at the conjugate pole” of the space (this corresponds to a 
change in y from 0 to 2); during the subsequent propagation, the ray begins to approach the 
initial point. A circuit of the ray "around the space", and return to the initial point, would 
correspond to a change of y from 0 to 27. From (114.1) we see that then n would also have 
to change by 27, which is, however, impossible (except for the one case when the light starts 
at a moment corresponding to n = 0). Thus a ray of light cannot return to the starting point 
after a circuit "around the space". 

To a ray of light approaching the point of observation (the origin of coordinates), there 
corresponds the negative sign on y in equation (114.1). If the moment of arrival of the ray 
at this point is (79), then for 7 = no we must have y = 0, so that the equation of propagation 
of such rays is 

X—o—T- (114.2) 


From this it is clear that for an observer located at the point y = 0, only those rays of light 
can reach him at the time t(y9), which started from points located at "distances" not 
exceeding y = Qo. 

This result, which applies to the open as well as to the closed model, is very essential. 
We see that at each given moment of time t(n), at a given point in space, there is accessible 
to physical observation not all of space, but only that part of it which corresponds to 
y <n. Mathematically speaking, the ‘‘visible region" of the space is the section of the four- 
dimensional space by the light cone. This section turns out to be finite for the open as well 
as the closed model (the quantity which is infinite for the open model is its section by the 
hypersurface f = const, corresponding to the space where all points are observed at one and 
the same time f). In this sense, the difference between the open and closed models turns out 
to be much less drastic than one might have thought at first glance. 

The farther the region observed by the observer at a given moment of time recedes from 
him, the earlier the moment of time to which it corresponds. Let us look at the spherical 
surface which is the geometrical locus of the points from which light started out at the time 
t(r — y) and is observed at the origin at the time f(y). The area of this surface is 4za?(g — y) 
sin? y (in the closed model), or 4na?(n — y) sinh? y (in the open model). As it recedes from 
the observer, the area of the “visible sphere" at first increases from zero (for y = 0) and then 
reaches a maximum, after which it decreases once more, dropping back to zero for y — 5 
(where a(n — y) = a(0) = 0). This means that the section through the light cone is not only 
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finite but also closed. It is as if it closed at the point “conjugate” to the observer; it can be 
seen by observing along any direction in space. At this point £ — oo, so that matter in all 
stages of its evolution is, in principle, accessible to observation. 

The total amount of observed matter is equal in the open model to 


" 
EET: f pa? sinh? y: dy. 
0 


Substituting ua? from (113.7), we get 


2 


3c*ag ,. 
Mo; = E (sinh y cosh n — n). (114.3) 


This quantity increases without limit as y > oo. In the closed model, the increase of M,,, 
is limited by the total mass M; in similar fashion, we find for this case: 


M . 
Moy, = — (1 sin n cos n). (114.4) 


As y increases from 0 to z, this quantity increases from 0 to M; the further increase of M,,, 
according to this formula is fictitious, and corresponds simply to the fact that in a ‘‘con- 
tracting" universe distant bodies would be observed twice (by means of the light "circling 
the space" in the two directions). 

Let us now consider the change in the frequency of light during its propagation in an 
isotropic space. For this we first point out the following fact. Let there occur at a certain 
point in space two events, separated by a time interval dt = (1/c)a (n) dy. If at the moments 
of these events light signals are sent out, which are observed at another point in space, then 
between the moments of their observation there elapses a time interval corresponding to the 
same change dy in the quantity 7 as for the starting point. This follows immediately from 
equation (114.1), according to which the change in the quantity y during the time of propaga- 
tion of a light ray from one point to another depends only on the difference in the co- 
ordinates x for these points. But since during the time of propagation the radius of curvature 
a changes, the time interval t between the moments of sending out of the two signals and the 
moments of their observation are different; the ratio of these intervals is equal to the ratio of 
the corresponding values of a. 

From this it follows, in particular, that the periods of light vibrations, measured in terms 
of the world time £, also change along the ray, proportionally to a. Thus, during the propaga- 
tion of a light ray, along its path, 


wa = const. (114.5) 
Let us suppose that at the time t(n) we observe light emitted by a source located at a 
distance corresponding to a definite value of the coordinate y. According to (114.1), the 
moment of emission of this light is t(7—). If wo is the frequency of the light at the time of 
emission, then from (114.5), the frequency œw observed by us is 

=o a(n— x) 

= Wo . 

a(n) 

Because of the monotonic increase of the function a(n), we have o < wo, that is, a decrease 
in the light frequency occurs. This means that when we observe the spectrum of light coming 


(114.6) 


372 RELATIVISTIC COSMOLOGY § 114 


toward us, all of its lines must appear to be shifted toward the red compared with the spec- 
trum of the same matter observed under ordinary conditions. The “red shift" phenomenon 
is essentially the Doppler effect of the galaxies’ "running away” from each other. 

The magnitude of the red shift measured, for example, as the ratio c/c of the displaced 
to the undisplaced frequency, depends (for a given time of observation) on the distance at 
which the observed source is located [in relation (114.6) there enters the coordinate y of 
the light source]. For not too large distances, we can expand a(n — y) in a power series in y, 
limiting ourselves to the first two terms: 

o y 20 


wo d a(n) 
(the prime denotes differentiation with respect to 4). Further, we note that the product 
ya(n) is here just the distance / from the observed source. Namely, the “radial” line element 
is equal to dl = a dy; in integrating this relation the question arises of how the distance is to 
be determined by physical observation. In determining this distance we must take the values 
of a at different points along the path of integration at different moments of time (integration 
for 7 = const would correspond to simultaneous observation of all the points along the 
path, which is physically not feasible). But for “small’’ distances we can neglect the change 
in a along the path of integration and write simply / = ay, with the value of a taken for the 
moment of observation. 

As a result, we find for the percentage change z in the frequency the following formula: 

w— Wo H 


FM oy (114.7) 
Wo c 


where we have introduced the notation 
(114.8) 


for the so-called “Hubble constant". For a given instant of observation, this quantity is 
independent of /. Thus the relative shift in spectral lines must be proportional to the distance 
to the observed light source. 

Considering the red shift as a result of a Doppler effect, one can determine the corres- 
ponding velocity v of recession of the galaxy from the observer. Writing z = v/c, and 
comparing with (114.7), we have 

v — HI (114.9) 
(this formula can also be obtained directly by calculating the derivative v = d(ay)/dr. 
Astronomical data confirm the law (114.7), but the determination of the value of the 


Hubble constant is hampered by the uncertainty in the establishment of a scale of cosmic 
distances suitable for distant galaxies. The latest determinations give the value 


H & 0.8x107 !9yr^! = 0.25 x 1077 sec^!, (114.10) 
1/H z:4 x 10!7 sec = 1.3x 10!9 yr, 
It corresponds to an increase in the “velocity of recession" by 75 km/sec for each mega- 
parsec distance.t 


t There also exist estimates leading to a smaller value of H, corresponding to an increase of the velocity 
of recession by 55 km/sec in each megaparsec; then 1/H~18 x 10? yr. 
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Substituting in equation (113.4), £ = uc* and H = ca’/a’, we get for the open model the 
following relation: 


T sH y (114.11) 


Combining this equation with the equality 


csinh y c "n 
"ae(coshg—1) a coth 2’ 
we obtain 
= 
n 3 
h: = H |—-. 
cosh ; V Sak (114.12) 
For the closed model we would get: 
c? Brk 2 
a= a-H.. (114.13) 
n 3 
~=H . 
cos 5 Jz (114.14) 


Comparing (114.11) and (114.13), we see that the curvature of the space is negative or 
positive according as the difference (8xk/3)u—h? is negative or positive. This difference 
goes to zero for u = u,, where 

3H? 
Me = up (114.15) 
With the value (114.10), we get 4, = 1 x 1079 g/cm’. In the present state of astronomical 
knowledge, the value of the average density of matter in space can be estimated only with 
very low accuracy. For an estimate, based on the number of galaxies and their average mass, 
one now takes a value of about 3 x 1073! g/cm?. This value is 30 times less than 4, and thus 
would speak in favor of the open model. But even if we forget about the doubtful reliability 
of this number, we should keep in mind that it does not take into account the possible 
existence of a metagalactic dark gas, which could greatly increase the average matter 
density. 

Let us note here a certain inequality which one can obtain for a given value of the quantity 

c sinh y 


. H = osh p 
H. For the open model we have aycosh n— 1? 


and therefore 
ag. sinh n(sinh n — n) 
t= ~ hy-n) = ————_—.. 
c (sinh n) H(cosh n—1)? 
Since 0 < y < oo, we must have 


inc «qe (114.16) 


Similarly, for the closed model we obtain 


_ sin n(n—sin n) 
^ H(1—cos n)? 
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To the increase of a(n) there corresponds the interval 0 < n < n; therefore we get 


2 

Osten (114.17) 
Next we determine the intensity 7 of the light arriving at the observer from a source 
located at a distance correspondingto a definite value of the coordinate y. The flux density of 
light energy at the point of observation is inversely proportional to the surface of the sphere, 
drawn through the point under consideration with center at the location of the source; in a 
space of negative curvature the area of the surface of the sphere equals 4na? sinh? y. 
Light emitted by the source during the interval dt = (1/c)a(yn—y)dn will reach the point 

of observation during a time interval 


a(n) 
a(n— x) 


1 
= - a(n) dn. 
c 


Since the intensity is defined as the flux of light energy per unit time, there appears in 7 a 
factor a(n—y)/a(y). Finally, the energy of a wave packet is proportional to its frequency 
[see (53.9)]; since the frequency changes during propagation of the light according to the 
law (114.5), this results in the factor a(n —y)/a(n) appearing in J once more. As a result, we 
finally obtain the intensity in the form 


a'(n- x) 
I= t————. 14.18 
Cons" Gp) sinh? y (114.18) 
For the closed model we would similarly obtain 
a*(n—x) 
I= t ;--. . 
cons a (p sin? y (114.19) 


These formulas determine the dependence of the apparent brightness of an observed object 
on its distance (for a given absolute brightness). For small y we can set a(n — y) = a(n), and 
then J ~ 1/a?(n)y? = 1/1?, that is, we have the usual law of decrease of intensity inversely 
as the square of the distance. 

Finally, let us consider the question of the so-called proper motions of bodies. In speaking 
of the density and motion of matter, we have always understood this to be the average 
density and average motion; in particular, in the system of reference which we have always 
used, the velocity of the average motion is zero. The actual velocities of the bodies will under- 
go a certain fluctuation around this average value. In the course of time, the velocities of 
proper motion of the bodies change. To determine the law of this change, let us consider a 
freely moving body and choose the origin of coordinates at any point along its trajectory. 
Then the trajectory will be a radial line, 0 = const, $ = const. The Hamilton-Jacobi 
equation (87.6), after substitution of the values of g'*, takes the form 


SV /aS\? 
(2) — (2) 4 m?c?a?(g) = 0. (114.20) 
OX on 


Since y does not enter into the coefficients in this equation (i.e., y is a cyclic coordinate), the 
conservation law @S/éy = const is valid. The momentum p of a moving body is equal, by 
definition, to p = 0S/0! = S/a éy. Thus for a moving body the product pa is constant: 


pa — const. (114.21) 
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Introducing the velocity v of proper motion of the body according to 


we obtain 


-= m const. (114.22) 


The law of change of velocity with time is determined by these relations. With increasing a, 
the velocity v decreases monotonically. 


PROBLEMS 


1. Find the first two terms in the expansion of the apparent brightness of a galaxy as a function 
of its red shift; the absolute brightness of a galaxy varies with time according to an exponential 
law, Jabs = const : e*' (H. Robertson, 1955). 

Solution: The dependence on distance y of the apparent brightness of a galaxy at the "instant" 
5, is given (for the closed model) by the formula 

a?(n — x) 
a*(n) sin? x 
We define the red shift as the relative change in wave length: 


I = const: etoos t(n)) 


| À-Àe _ —o  a(m-—a(-2) 
do e a(n —x) 


Expanding 7 and z in powers of x [using the functions a(n) and t() from (112.9) and (112.10)] and 
then eliminating x from the resulting equations, we find the result: 


l 
I =const =] 1- ( -14 a): 


where we have introduced the notation 


2 H 
-———— =>>]. 
q 1l+cosy Hk 
For the open model, we get the same formula with 
2 H 
q =- <l. 


= l+coshy uk 


2. Find the leading terms in the expansion of the number of galaxies contained inside a "sphere" 
of given radius, as a function of the red shift at the boundary of the sphere (where the spatial 
distribution of galaxies is assumed to be uniform). . 

Solution: The number N of galaxies at "distances" « x is (in the closed model) 


x 


N = const f sin? x dx = const: x?. 
o 
Substituting the first two terms in the expansion of the function x(z), we obtain: 
3 
N — const : z? | -3 atoz]. 


In this form the formula also holds for the open model. 
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§ 115. Gravitational stability of an isotropic universe 


Let us consider the question of the behavior of small perturbations in the isotropic model, 
i.e. the question of its gravitational stability (E. M. Lifshitz, 1946). We shall restrict our 
treatment to perturbations over relatively small regions of space—regions whose linear 
dimensions are small compared to the radius a.f 

In every such region the spatial metric can be assumed to be euclidean in the first approxi- 
mation, i.e. the metric (111.8) or (111.12) is replaced by the metric 


dl? = a*(n\(dx*+dy?+dz’), (115.1) 


where x, y, z are cartesian coordinates, measured in units of the radius a. We again use the 
parameter 7 as time coordinate. 

Without loss of generality we shall again describe the perturbed field in the synchronous 
reference system, i.e. we impose on the variations óg;, of the metric tensor the conditions 
Joo = 0go, = 0. Varying the identity g;, u‘u* = | under these conditions (and remembering 
that the unperturbed values of the components of the four-velocity of the matter are 
u? = Ifa, u* =0),f we get goo u°du° = 0,so that ĉu? = 0. The perturbations ĝu” are in general 
different from zero, so that the reference system is no longer co-moving. 


We denote the perturbations of the spatial metric tensor by hag = óy,, = —5g,. Then 
ôy? = —h**, where the raising of indices on /,, is done by using the unperturbed metric 
Yap: 


In the linear approximation, the small perturbations of the gravitational field satisfy the 


equations 


. 8nk 
bRi—46,5R = "XI (115.2) 


In the synchronous reference system the variations of the components of the energy- 
momentum tensor (94.9) are: 
ôT? = —ófàp, STZ =a(ptejiu", T$ = óc. (115.3) 


Because of the smallness of de and dp, we can write óp = (dp/de)de, and we obtain the 
relations: 


d 
ST! = —58 -P sms. (115.4) 
de 


Formulas for óR can be gotten by varying the expression (97.10). Since the unperturbed 


metric tensor Yag = 475,,, the unperturbed values are 
2d 2a’ a’ 
= = = B 
Xap = a Yap = a? Yap» xf = a? Ôg» 


where the dot denotes differentiation with respect to cf, and the prime with respect to n. The 
perturbations of x,, and xf = Kay 7 are: 


1 l 
x, = hag = , hig, OK = — hy, + yh, = hb = P hë 
where h? = y*’h,,. For the euclidean metric (115.1) the unperturbed values of the three- 


+ A more detailed presentation of this question, including the investigation of perturbations over regions 
whose size is comparable to a, is given in Adv. in Physics 12, 208 (1963). 
1 In this section we denote unperturbed values of quantities by letters without the auxiliary superscript '?, 
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dimensional P? are zero. The variations ôP? are calculated from formulas (108.3-4): it is 
obvious that oP? is expressed in terms of the dy,, just as the four-tensor 5R,, is expressed 
in terms of the óg;,, all tensor operations being done in the three-dimensional space with the 
metric (115.1); because this metric is euclidean, all the covariant differentiations reduce 
to simple differentiations with respect to the coordinates x* (for the contravariant deriva- 
tives we must still divide by a”). Taking all this into account (and changing from derivatives 
with respect to ¢ to derivatives with respect to 7), we ic after some pL calculations: 


ôR! = - — EIC: Phbi-hbi-kh5--— LM x T he Lh 
1 , 
ôR = — 55 es 
ÖRS = jit, (115.5) 


(h= hj). Here both the upper and lower indices following the comma denote simple 
differentiations with respect to the x^ (we continue to write indices above and below only to 
retain uniformity of the notation). 

We obtain the final equations for the perturbations A^ by substituting in (115.4) the 
components Tf, expressed in terms of the ôR* according to (115.2). For these equations 
it is convenient to choose the equations obtained from (115.4) for «# f. and those obtained 
by contracting on x, f. They are: 


(hi5 ho 2— hee — hb?) 4 n" + E MP =0, a x B, 


Potes dp dp 
- (hè — "rh 243 — 0. 115.6 
071 <p (1432) +h" eho (24 =) = (115.6) 

The perturbations of the density and matter velocity can be determined from the known 
h? using formulas (115.2-3). Thus we have for the relative change of the density: 


ôe c* 1 c^ 
— = —— ( ôR?— -ôR -h! 2r 115. 
€ exl 92 )- 16xkea? 3 (nt a ) dum 


Among the solutions of equations (115.6) there are some that can be eliminated by a simple: 
transformation of the reference system (without destroying the condition of synchronism), 
and so do not represent a real physical change of the metric. The form of such solutions can 
be established by using formulas (1) and (2) in problem 3 of 8 97. Substituting the unper- 
turbed values y,, = a76,,, we get from them the following expressions for fictitious perturba- 
tions of the metric: 


d + 
i - ft [Te fot AD, (115.8) 


where the fo, f, are arbitrary (small) functions of the coordinates x, y, z 

Since the metric in the small regions of space we are considering is assumed to be euclidean, 
an arbitrary perturbation in such a region can be expanded in plane waves. Using x, y, z for 
cartesian coordinates measured in units of a, we can write the periodic space factor for the 
plane waves in the form e", where n is a dimensionless vector, which represents the wave 
vector measured in units of l/a (the wave vector is k = n/a). If we have a perturbation over 
a portion of space of dimensions ~ /, the expansion will involve waves of length 
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A = 2na[n ~ l. If we restrict the perturbations to regions of size / « a, we automatically 
assume the number n to be quite large (n > 2n). 

Gravitational perturbations can be divided into three types. This classification reduces 
to a determination of the possible types of plane waves in terms of which the symmetric 
tensor A,, can be represented. We thus obtain the following classification: 

1. Using the scalar function 

Q =e", (115.9) 
we can form the vector P = nQ and the tensors} 


B 
Q-iMQ, P TIE of S) (115.10) 


These plane waves correspond to perturbations in which, in addition to the gravitational 
field, there are changes in the velocity and density of the matter, i.e. we are dealing with per- 
turbations accompanied by condensations or rarefactions of the matter. The perturbation 
of hf is expressed in terms of the tensors Q^ and P^, the perturbation of the velocity is 
expressed in terms of the vector P, and the perturbation of the density, in terms of the 
scalar Q. 
2. Using the transverse vector wave 
S-se"", s-n=0, (115.11) 
we can form the tensor (nS, -- n, S?); the scalar corresponding to this does not exist, since 
n:S — 0. These waves correspond to perturbations in which, in addition to the gravita- 
tional field, we have a change in velocity but no change of the density of the matter; they 
may be called rotational perturbations. 
3. The transverse tensor wave 
Gi-gíe"" gin, =0. (115.12) 
We can construct neither a vector nor a scalar by using it. These waves correspond to 
perturbations of the gravitational field in which the matter remains at rest and uniformly 
distributed throughout space. In other words, these are gravitational waves in an isotropic 
universe. 
The perturbations of the first type are of principal interest. We set 


hE = AmPi-aGDnOL h= pQ. (115.13) 
From (115.7) we find for the relative change of the density 
ôe ct 2 3a' , 
T" oe wl 0. (115.14) 


The equations for determining A and p are gotten by substituting (115.13) in (115.6): 
2a' n? 
A+ —N- SA =0 
uu 3 0*0 


dp n? dp 
"r — — (å 1+3— }=0. . 
p 4 v (243 2)*5 (A+ »( + 3) (115.15) 


These equations have the following two partial integrals, corresponding to those fictitious 


+ We write upper and lower indices on the components of ordinary cartesian tensors only to preserve 
uniformity of notation. 
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changes of metric that can be eliminated by transforming the reference system: 


A= — u = const, (115.16) 


d dy 3a 
is -» (7. pan? | (115.17) 
a a a 


[the first of these is gotten from (115.8) by choosing fo = 0, f, = P,; the second by choosing 
fo 7 Q, f, = 0]. 

In the early stages of expansion of the universe, when the matter is described by the 
equation of state p = e/3, we have a = a,n, n <1 (in both the open and closed models). 
Equations (115.15) take the form: 


2 2 3, 2n? 
iy aS tw =0, et at tw) = 0. (115.18) 


These equations are conveniently investigated separately for the two limiting cases depend- 
ing on the ratio of the large quantities n and 1/n. 

Let us assume first that n is not too large (or that 7 is sufficiently small), so that ny < 1. 
To the order of accuracy for which the equations (115.18) are valid, we find from them for 


this case: 
3C n? 2n? n? 
i= S40, (14 59°), w= Leser Fa), 
"n 9 3 6 
where C,, C, are constants; solutions of the form (115.16) and (115.17) are excluded (in the 
present case these are the solution with 4 = — = const and the one with A + u ~ 1/y?). 


Calculating de/e from (115.14) and (112.15), we get the following expressions for the per- 
turbations of the metric and the density: 


-.36 


h PE + CQ PD, 


2 
36 LPL (Cina Con)0 for p=-, TH (115.19) 
e 9 3 n 

The constants C, and C, must satisfy conditions expressing the smallness of the perturba- 
tion at the time mo of its start: we must have hf < 1 (so that 4 < l and u < 1) and de/e < 1. 
As applied to (115.19) these conditions give the inequalities C, < no, C; < 1. 

In (115.19) there are various terms that increase in the expanding universe like different 
powers of the radius a = a, n. But this growth does not cause the perturbation to become 
large: if we apply formula (115.19) for an order of magnitude to 4 ~ 1/n, we see that 
(because of the inequalities found above for C, and C;) the perturbations remain small 
even at the upper limit of application of these formulas. 

Now suppose that n is so large that ny > 1. Solving (115.18) for this condition, we find 
that the leading terms in À and y are:t 


-Hü 


À = = const: i gin? 
T 


We then find for the perturbations of the metric and the density: 


+ The factor 1/5? in front of the exponential is the first term in the expansion in powers of 1/ng. To find 
it we must consider the first two terms in the expansion simultaneously [which is justified within the limits 
of accuracy of (115.18)]. 
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pe = — (P5 —2Qfyei!*3, be = — C Qe 5 

a a a E 9 
" (115.20) 


1 
for p=, -«mn«l, 
3 n 


where C is a complex constant satisfying the condition |C| «1. The presence of a periodic 
factor in these expressions is entirely natural. For large n we are dealing with a perturbation 
whose spatial periodicity is determined by the large wave vector k = n/a. Such perturbations 
must propagate like sound waves with velocity 


TEE 
d(elc?) V3 
Correspondingly the time part of the phase is determined, as in geometrical acoustics, by 


the large integral f ku dt = m3. As we see, the amplitude of the relative change of 
density remains constant, while the amplitude of the perturbations of the metric itself 
decreases like a^? in the expanding universe. t 

Now we consider later stages of the expansion, when the matter is already so rarefied 
that we can neglect its pressure (p = 0). We shall limit ourselves to the case of small y, 
corresponding to that stage of the expansion when the radius a was still small compared to 
its present value, but the matter was already quite rarefied. 

For p = 0 and n < 1, we have a © agn?/2, and (115.15) takes the form: 


n? 


4 
A+- A — > (A+) =O, 
Nn 3 
2 


4 
uU Tw + (ài) — 0. 
Nn 3 


The solution of these equations is 


C 2 
À-c u-2C;, ine ase (E 426) 
15 n? 


Also calculating ôe/e by using (115.14) and (112.12), we find: 


2 
M = C\(PE+Q8)+ HCR pL gh) for n<5, 
n 


2 
hf = Ci pnp! — pa Ca (pp £) for l <n <i, (115.21) 
15 n n 
be (eor " em) Q. 
£ 30 m 


We see that óe/e contains terms that increase proportionally with a.f But if nn < 1,then 


t It is easy to verify that (for p = e/3) nn ~ L/A, where L~ulV kefc?. It is natural that the characteristic 
length L, which determines the behavior of perturbations with wave length à <a, contains only hydro- 
dynamic quantities—the matter density e/c? and the sound velocity u (and the gravitational constant X). 
We note that there is a growth of the perturbations when 42» L [in (115.19)]. 

1 A more detailed analysis taking into account the small pressure p(e) shows that the possibility of neglect- 


ing the pressure requires that one satisfy the condition ugn/c «& 1 (where u = cV dplde is the small sound 
velocity); it is easy to show that in this case also it coincides with the condition A/L > 1. Thus, growth of the 
perturbation always occurs if A/L 2» 1. 
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de/e does not become large even for n ~ l/n because of the condition C, < 1. If, however, 
nn > 1, then for 7 ~ 1 the relative change of density becomes of order C, 7’, while the 
smallness of the initial perturbation requires only that C,z?5à < 1. Thus, although the 
growth of the perturbation occurs slowly, nevertheless its total growth may be considerable, 
so that it becomes quite large. 

One can similarly treat perturbations of the second and third types listed above. But the 
laws for the damping of these perturbations can also be found without detailed calculations 
by starting from the following simple arguments. 

lf over a small region of the matter (with linear dimensions /) there is a rotational per- 
turbation with velocity ôv, the angular momentum of this region is ~ (e/c?)/> - 1: v. During 
the expansion of the universe / increases proportionally with a, while e decreases like a^? 
(in the case of p = 0) or like a^ * (for p = ¢/3). From the conservation of angular momentum, 
we have 


l 
dv = const for p = e/3,0v ~ for p = 0. (115.22) 


Finally, the energy density of gravitational waves must decrease during the expansion 
of the universe like a^ *. On the other hand, this density is expressed in terms of the pertur- 
bation of the metric by ~ k?(h7)?, where k = n/a is the wave vector of the perturbation. 
It then follows that the amplitude of perturbations of the type of gravitational waves 
decreases with time like 1/a. 


$ 116. Homogeneous spaces 


The assumption of homogeneity and isotropy of space determines the metric completely 
(leaving free only the sign of the curvature). Considerably more freedom is left if one assumes 
only homogeneity of space, with no additional symmetry. Let us see what metric properties 
a homogeneous space can have. 

We shall be discussing the metric of a space at a given instant of time t. We assume that 
the space-time reference system is chosen to be synchronous, so that f is the same synchro- 
nized time for the whole space. 

Homogeneity implies identical metric properties at all points of the space. An exact 
definition of this concept involves considering sets of coordinate transformations that trans- 
form the space into itself, i.e. leave its metric unchanged: if the line element before trans- 
formation ts 

dl? = yyp(x', x7, x?) dx* dx’, 


then after transformation the same line element is 
d? = Ya. x7, x°) dx" dx'^, 


with the same functional dependence of the y,, on the new coordinates. A space is homo- 
geneous if it admits a set of transformations (a group of motions) that enables us to bring 
any given point to the position of any other point. Since space is three-dimensional the 
different transformations of the group are labelled by three independent parameters. 
Thus, in euclidean space the homogeneity of space is expressed by the invariance of the 
metric under parallel displacements (translations) of the cartesian coordinate system. Each 
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translation is determined by three parameters—the components of the displacement vector 
of the coordinate origin. All these transformations leave invariant the three independent 
differentials (dx, dy, dz) from which the line element is constructed. 

In the general case of a noneuclidean homogeneous space, the transformations of its 
group of motions again leave invariant three independent linear differential forms, which 
do not, however, reduce to total differentials of any coordinate functions. We write these 
forms as ae 

et? dx*, (116.1) 
where the Latin index (a) labels three independent vectors (coordinate functions); we call 
these vectors a frame. 

Using the forms (116.1) we construct a spatial metric invariant under the given group of 
motions: 

dl? = net? dx*Y(efP dx), (116.2) 
i.e. the metric tensor is 
Ja fact epo. (116.3) 
where the coefficients na, which are symmetric in the indices a and b, are functions of the 
time. f 

Thus we arrive at a “triad” representation of the spatial metric using a triple of coordi- 
nate vectors; all the formulas obtained in § 98 are applicable to this representation. The 
choice of basis vectors is dictated by the symmetry properties of the space and, in general, 
these basis vectors are not orthogonal (so that the matrix na, is not diagonal). 

As in § 98, along with the triple of vectors e(^? we introduce the reciprocal triple of vectors 
€(ay: for which 

Cate ED. aes = 0p. (116.4) 


In the three-dimensional case, the relation between the two vector triples can be written 
explicitly: 
— ay, ay | (a) a) l oD 
Ce X. ay ci E xe emne xe’, (116.5) 


where 

p = fe} = e ex en, 
and e) and e^ should be regarded as cartesian vectors with components e%,, and e? 
respectively. The determinant of the metric tensor (116.3) is 


2 


y = m, (116.6) 


where 7 is the. determinant of the matrix 5. 
The invariance of the differential forms (116.1) means that 


ex) dx* = ef? (x') dx", (116.7) 
where the e?) on the two sides of the equation are the same functions of the old and new 
B 
coordinates, respectively. Multiplying this equation by ef,,(x’), setting dx’? = a dx*, 
x 
and comparing coefficients of the same differentials dx’, we find 
ôx”? ] 
= eb (x')et?Xx). (116.8) 


ôx” 
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These equations are a system of differential equations that determine the functions x’? (x) 
for a given frame.t In order to be integrable, the equations (116.8) must satisfy identically 
the conditions 
9?x'? QAx'? 
Ox'Ox! | Ox'üx* 
Calculating the derivatives, we find 
e^, (x') efx) 5, a „ (dex) dex) 
| AV eip (x) Avi ex ) e (x)et (x) = ef (x ) an ^ ov of 


Multiplying both sides of the equations by e(,)(x)e?.)(x)ey’ (x) and shifting the differentia- 
tion from one factor to the other by using (116.4), we get for the left side: 


e? N 
«Poo RED en - ED n] = doene [EEREN 


ax’? €(c) ax 75 ox? 

and for the right, the same expression in the variable x. Since x and x' are arbitrary, these 
expressions must reduce to constants: 

($4 - def 


ax®  ax® ) Care) = C a (116.9) 


The constants C*,, are called the structure constants of the group. Multiplying by ey., we can 
rewrite (116.9) in the form 


Ge} oet, [4 
ay o ebb) ag = Cato (116.10) 


These are the required conditions for homogeneity of the space. The expression on the 
left side of (116.9) coincides with the definition of the quantities 4^,, (98.10), which are 
therefore constants. 

As we see from their definition, the structure constants are antisymmetric in their lower 
indices: 

Cu = — Cy. (116.11) 
We can obtain still another condition on them by noting that (116.10) can be written in the 
form of commutation relations 


[Xa X,] = X,X, — X,X, = Cp X, (116.12) 
for the linear differential operators} 
ð 
Xa = Cla za (116.13) 


'? — x? .-£, where the £^ are small quantities, we obtain from 


t For a transformation of the form x 
(116.8) the equations 
e" = E Beia e. 
ox" Ox” * 
The three linearly independent solutions of these equations, f(b = 1,2, 3), determine the infinitesimal 
transformations of the group of motions of the space. The vectors gy are called the Killing vectors. (Cf. 


the footnote on p. 271). 

+ In the mathematical theory of continuous groups (Lie groups) the operators X, satisfying conditions 
of the form (116.12) are called the generators of the group. We mention, however (to avoid confusion when 
comparing with other presentations), that the systematic theory usually starts from operators defined using 
the Killing vectors: 


(116.8a) 


ô 
X, = f ox’ 
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Then the condition mentioned above follows from the identity 
[[X.. X,]. X] [[X.. X), XJ+ [[X.. Xa), X,] -0 
(the Jacobi identity), and has the form: 


Ct C CV CS C Chey = O. (116.14) 


It is a definite advantage to use, in place of the three-index constants C;, a set of two- 
index quantities, obtained by the dual transformation 


Coon eus CP. (116.15) 


where e,,, = e^* is the unit antisymmetric symbol (with e, 33 = +1). With these constants 
the commutation relations (116.12) are written as 


e X, X. = CX, (116.16) 


The property (116.11) is already taken into account in the definition (116.15), while pro- 
perty (116.14) takes the form 


epea C CP = 0. (116.17) 


We also mention that the definition (116.9) for the quantities C^" can be written in vector 
form: 


l 
C? = lem. V xe, (116.18) 
" 


where the vector operations are again carried out as if the coordinates x^ were cartesian. 
The choice of the three frame vectors in the differential forms (116.1) (and with them the 


operators X,) is, of course, not unique. They can be subjected to any linear transformation 
with constant coefficients: 


Ca) = Anes): l (116.19) 


The quantities na, and C^^ behave like tensors with respect to such transformations. 

The conditions (116.17) are the only ones that the structure constants must satisfy. 
But among the constants admissible by these conditions, there are equivalent sets, in the 
sense that their difference is related to a transformation of the type (116.19). The question 
of the classification of homogeneous spaces reduces to determining all nonequivalent sets 
of structure constants. This can be done, using the “tensor” properties of the quantities 
C”, by the following simple method (C. G. Behr, 1962). 

The unsymmetric “tensor” C* can be resolved into a symmetric and an antisymmetric 
part. We denote the first by n^", and we express the second in terms of its “dual vector" a,: 


C^ = n+ ea (116.20) 
Substitution of this expression in (116.17) leads to the condition 
na, = 0. (116.21) 


By means of the transformations (116.19) the symmetric "tensor" n® can be brought to 
diagonal form: let n, n;, n, be its eigenvalues. Equation (116.21) shows that the "vector" 
a, (if it exists) lies along one of the principal directions of the "tensor" 5^5, the one corres- 
ponding to the eigenvalue zero. Without loss of generality we can therefore set a, = (a, 0, 0). 
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Then (116.21) reduces to an, = 0, i.e. one of the quantities a or n, must be zero. The com- 
mutation relations (116.12) take the form: 


[Xi Xi] = -aX,4+n, X3, 

[Xn X] =n, X, (116.22) 

[X5. Xi) = n, X,+ax3. 
The only remaining freedom is a change of sign of the operators X, and arbitrary scale 
transformations of them (multiplication by constants). This permits us simultaneously to 
change the sign of all the n, and also to make the quantity a positive (if it is different from 
zero). We can also make all the structure constants equal to +1, if at least one of the 
quantities a, n}, n, vanishes. But if all three of these quantities differ from zero, the scale 
transformations leave invariant the ratio a?/n,n,.t 

Thus we arrive at the following list of possible types of homogeneous spaces; in the first 


column of the table we give the roman numeral by which the type is labelled according to 
the Bianchi classification (L. Bianchi, 1918):t 


(1) n? (2) 


Type 


e 
3 
3 


I 

II 
VII, 
VI, 
IX 


| 
—oo-—-——oo 


S —-—oooooccoc 
—_—— O — OOOO 


VII, 
III (a — 1) 
VI, (a x 1) 


o oocoo-—-—-—-—o 


2 
! 


Type I is euclidean space; all components of the spatial curvature tensor vanish (cf. 
formula (116.24) below). In addition to the trivial case of a galilean metric, this also includes 
the time-dependent metric that will be considered in the next section. 

Type IX contains, as a special case, the space of constant positive curvature. It is obtained 
if, in the line element (116.2), we set na, = Ôa/4A, where A is a positive constant. In fact, 
calculating from (116.24) with C!! = C?? = C?? = | (the structure constants for type IX) 
gives Pras) = 46,5 and so i 

Pap = Piom e ep? = 2åYap» 


which just corresponds to such a space (ef. (111.3)]. 
In analogous fashion the space of constant negative curvature is contained as a special 


+ Strictly speaking, to conform to the “tensor” properties of the C°? we should introduce a factor 7 
in the definition (116.15) (cf. the remarks in § 83, which describe how the antisymmetric unit tensor must be 
defined with respect to arbitrary transformations of coordinates). We shall not, however, enter on these 
details here: for our purpose we can extract the law of transformation of the structure constants directly 
from eqs. (116.22). 

+ The parameter a runs through all positive values. The corresponding types are actually one-parameter 
families of different groups. Their assignment to types V1 and VII is a matter of convention. 
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case in type V. Setting na = 5,,/A and calculating P aya from (116.24) with C? = —C*? = 1 
we get 


, 


Piao = — 26,5, Pag = ~ 2Ayags 


which corresponds to a constant negative curvature. 

Finally, we show how the Einstein equations for a universe with a homogeneous space 
reduce to a system of ordinary differential equations containing only functions of the time. 
To do this we must resolve the spatial components of four-vectors and four- tensors along 
the triad of basis vectors of the space: 


(a) (a) 


as B m a = 
Rayo, = Rap ayo)» Roay = Roata U u“ ez 


where all these quantities are now functions of t alone; the scalar quantities, the energy 
density £ and the pressure of the matter p, are also functions of the time. 
According to (97.11-13), the Einstein equations in the synchronous frame are expressed 
in terms of three-dimensional tensors x,, and P,,. For the first we have simply 
X(ay(5) = Nav: x = n, ^ (116.23) 


(the dot denotes differentiation with respect to t). The components of P,a) can be expressed 
in terms of the quantities 7,, and the structure constants of the group by using (98.14). 
After replacing the three-index symbols 27. = Cz, by two-index symbols C^ and various 
transformationst we get: 


1 
Pia} = 2n Qc Caa + c^ Cau c^ Cua = CLC’; + C.) + 


Fé (C4)? — 20% C,,]). (116.24) 
Here, in accordance with the general rule, 
cy = Nac G^, Ca = Plac Noa ce. 


We also note that the Bianchi identities for the three-dimensional tensor P,, in the homo- 
geneous space take the form 


P*, Ca HPC a = 0. (116.25) 
The final expressions for the triad components of the Ricci four-tensor are: 
R$ = — Pid) —bel x's, 


Ria) aa —4x5}(C?.g — 58 Chie), 
PR 
RO = -ig o" xÈ) — pb), (116.26) 


We emphasize that in setting up the Einstein equations there is thus no need to use explicit 
expressions for the basis vectors as functions of the coordinates. 


t In which we use the formulas 


Naa We 1c; e*t! = Nar © et — à, oo: òp- 


abf 


t The covariant derivatives x^, 


footnote on p. 293. 


which enter in R2, are transformed using the formulas derived in the 
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§ 117. The flat anisotropic model 


The adequacy of the isotropic model for the description of the later stages of evolution 
of the universe is in itself no reason for expecting that it will be equally suited for the 
description of early stages of the evolution, near the time singularity. This question will 
be discussed in detail in § 119, but in this and the following sections we shall, as a prelimi- 
nary, consider solutions of the Einstein equations that also have a time singularity, but 
of an essentially different type from the Friedmann singularity. 

We shall look for solutions in which, for a suitable choice of reference frame, all compo- 
nents of the metric tensor are functions of a single variable, the time x? = t.t Such a 
question was already considered in § 109, where, however, we treated only the case when 
the determinant |g,,| = 0. As was shown in § 109, in such a case, we can, without loss of 
generality, set all the go, = 0. 

By a transformation of the variable t according to \/go9 dt-»dt, we can then make goo 
equal to unity, so that we obtain a synchronous reference system, in which 

Goo = 1, Joa = 0, gag = — Yalt). (117.1) 

We can now use the Einstein equations of gravitation in the form (97.11)-(97.13). Since 
the quantities y,,, and with them the components of the three-dimensional tensor Kag = Yap, 
do not depend on the coordinates x^, Ro, 0. For the same reason, P540, and as a result 
the equations of the gravitational field in vacuum reduce to the following system: 


X. ixixg = 0, (117.2) 
1 
— (Vy x8) = 0 (117.3) 
vy 
From equation (117.3) it follows that 
Vy xf = 248, (117.4) 
where the 4? are constants. Contracting on the indices « and fl, we then obtain 
y 2 
Mga b= A 
Y v7 


from which we see that y = const: t?. Without loss of generality we may set the constant 
equal to unity (simply by a scale change of the coordinates x?); then Aj = |. Substitution 
of (117.4) into equation (117.2) now gives the relation 


MAS = 1 (117.5) 


which relates the constants Àf. 
Next we lower the index f! in equations (117.4) and rewrite them as a system of ordinary 
differential equations: 


2 
jap = 1 Ai Yyp- (117.6) 


The set of coefficients A? may be regarded as the matrix of some linear substitution. By a 
suitable linear transformation of the coordinates x!, x?, x? (or, what is equivalent, of g; p 


f To simplify writing of formulas, we set c — 1 in $8 117-118. 
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926» 935), We can in general bring this matrix to diagonal form. We shall denote its principal 
values (roots of the characteristic equation) by p,, p;, p3, and assume that they are all real 
and distinct (concerning other cases, cf. below); the unit vectors along the corresponding 
principal axes are n(!, n? and nO. Then the solution of equations (117.6) can be written 
in the form 


Yap = PPM nj? + EPn 4 Pn n>? (117.7) 


(where the coefficients of the powers of t have been made equal to unity by a suitable scale 
change of the coordinates). Finally, choosing the directions of the vectors n), n?), n? as 
the directions of our axes (we call them x, y, z), we bring the metric to the final form 
(E. Kasner, 1922): 


ds? = di? —1??! dx? iP dy? —1??3 qz? (117.8) 
Here p,, p; and p, are any three numbers satisfying the two relations 
PitPotp3=1, pitpit+p3=1 (117.9) 


{the first of these follows from —g = 17, and the second—from (117.5). 

The three numbers p,, pa and p, obviously cannot all have the same value. The case 
where two of them are equal occurs for the triples 0,0,] and — 1/3, 2/3, 2/3. In all other cases 
the numbers p,, p; and p, are all different, one of them being negative and the other two 
positive. If we arrange them in the order p, < p; < p,, their values will lie in the intervals 


—-ixp,«0, O<p,<#, 2jixp«xl (117.10) 


Thus the metric (117.8) corresponds to a flat homogeneous but anisotropic space whose 
total volume increases (with increasing t) proportionally to t; the linear distances along two 
of the axes (y and z) increase, while they decrease along the third axis (x). The moment 
t = Q is a singular point of the solution; at this point the metric has a singularity which 
cannot be eliminated by any transformation of the reference system where the invariants 
of the four-dimensional curvature tensor vanish at infinity. T 

The metric (117.8) is an exact solution of the Einstein equations for empty space. But 
near the singular point, for small t, it remains an approximate solution (to terms of highest 
order in 1/t) even for the case of matter uniformly distributed in space. How and how fast 
the matter density changes is then determined simply by its equations of motion in the 
given gravitational field, while the influence of the matter back on the field is negligible. 
As too, the matter density tends to infinity—in accordance with the physical character 
of the singularity (cf. problem 3). 


T The only exception is the case where p, = pa = 0, pa = |. For these values we simply have a flat 
space-time; by the transformation ¢ sinh z = (C, f cosh z = 7 we can bring the metric (117.8) to galilean 
form. We also make reference to a paper that gives a variety of exact solutions of various types for the 
Einstein equations in vacuum, depending on a larger number of variables. B. K. Harrison, Phys. Rev. 116, 
1285 (1959). 

A solution of the type of (117.8) also exists in the case where the parameter is spacelike; we need only 
make the appropriate changes of sign, for example, 


ds? = x?’ : dp —dyx1— x1? 3 dy?—x?” > dz?. 
However, in this case there also exist solutions of another type, which occur when the characteristic equation 
of the matrix A in equations (117.6) has complex or coincident roots (cf. problems 1 and 2). For the case of 


a timelike parameter 7, these solutions are not possible, since the determinant g in them would not satisfy 
the necessary condition g <0. 
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PROBLEMS 


1. Find the solution of equations (117.6) corresponding to the case where the characteristic 
equation of the matrix 2% has one real (p3) and two complex (pi, 2 == p’+ ip”) roots. 

Solution: In this case the parameter x°, on which all the quantities depend, must have spacelike 
character; we denote it by x. Correspondingly, we must now have goo = —1 in (117.1). Equations 
(117.2-3) are not changed. - 

The vectors n'?, n in (117.7) become complex: n: 2 = (n'+in”)/ V2, where n’, n^ are unit 
vectors. Choosing the axes x!, x?, x? along the directions n', n^, n9, we obtain the solution in the 
form 


, " x poe ” x 
—911 = o2 = x?’ cos (2 In Ji gi» = — x?” sin (2p In z), 


933 = — X73, =g = —go0]gas| =x, 
where a is a constant (which can no longer be eliminated by a scale change along the x axis, without 
changing other coefficients in the expressions given). The numbers, pi, ps, pa again satisfy the rela- 
tions (117.9), where the real number p, is either > —1°3 or >l. 
2. Do the same for the case where two of the roots coincide (pa = ps). 
Solution: We know from the general theory of linear differential equations that in this case the 
system (117.6) can be brought to the following canonical form: 


2 . 2 . 2 A 
P ous Goa = P gaa, gaa = Z2 gaat ^ gae, a=2, 3, 
x x x x 


where 4 is a constant. If 4 — 0, we return to (117.8). If 4 #0, we can put 4 = 1; then 


Gu = 


gii PPE ga, a x?" ga, = bx??? +a. x??? In x. 


From the condition g2; = g32, we find that a; = 0, a, := bı. By an appropriate choice of scale 
along the x? and x^ axes, we finally bring the metric to the following form: 
ds? = -- dx? -= x? (dx!) — 2x??? dx? dx? +x?’ in xa (dx?)?. 

The numbers pı, p: can have the values 1,0 or — [,.3, 2/3. 

3. In the neighborhood of the singular point ¢ = O find the law of variation with time of the 
density of matter distributed uniformly in the space with metric (117.8). 

Solution: Neglecting the back influence of the matter on the field, we start from the hydrodynamic 
equations of motion 


1 č E 
yn 6 —— G ' = 0, 
Vy Ox (v g ou) 
ĉu, gi Óp k Op 
(pent (rcu EE) = — Se at B a 


contained in the equations Ti, = O (cf. Fluid Mechanics, § 125). Here a is the entropy density; 
near the singularity we must use the ultrarelativistic equation of state, p = ¢/3, and then o —?/*, 

We denote the time factors in (117.8) by a = (^, b = ("*, c = f^^, Since all quantities depend 
only on the time, and V/-g - abc, eqs. (1) give 


d du de 
qg (abono =0, 4- dr ur = 
Then 
abcuge>'* = const, (2) 
u,e'/* = const. (3) 


According to (3) all the covariant components «, are of the same order of magnitude. Of the contra- 
variant components, the largest (for *--0) is u? = u3/c?. Keeping only the largest terms in the 
identity s, = 1, we therefore get u2 uae. = (u3)?/c?, and then, from (2) and (3): 


1 


f 
~, u,~Vab, 
ab? z 


" 


Or 
pO MPa PD. = 7720r u, conn, (4) 
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As it should, e tends to infinity as r—0 for all values of ps except pa = 1, in accordance with the 
fact that a singularity in the metric with exponents (0, 0, 1) is not physical. 

The validity of the approximations used is verified by estimating the components 7* that were 
dropped on the right sides of Eqs. (117.2-3). Their main terms are: 


T? ~euz mt 0 tm, Tip n 
T? eu gu? Tta T? ~eut? mtt, 


They all actually increase more slowly as z->0 than the left sides of the equations, which grow like 
(72, 


§ 118. Oscillating regime of approach to a singular point 


Using the model of a universe with a homogeneous space of type IX we shall study the 
time singularity of the metric which has oscillatory character (V. A. Belinskii, E. M. Lifshitz, 
I. M. Khalatnikov, 1968). We shall see in the next section that such a situation has a very 
general significance. 

We shall be interested in the behavior of the model near the singularity (which we 
choose as the time origin t = 0). As in the Kasner solution treated in § 117, the presence: 
of matter does not affect the qualitative properties of this behavior. For simplicity we shall 
therefore assume at first that the space is empty. 

We take the quantities a(t) in (116.3) to be diagonal, denoting the diagonal elements by 
a^, b^, c^; we here denote the three frame vectors e!, e?, e? by 1, m, n. Then the spatial 
metric is written as: 

Yap = 271, lg - b^ m, m, 4 c?n,n,. (118.1) 
For a space of type IX the structure constants are:t 
C= C= CH] (118.2) 
(and C',, = C^,, = C*,, = 1). 


From (116.26) it can be seen that for these constants and a diagonal matrix y,,, the compo- 
nents Re) of the Ricci tensor vanish identically in the synchronous reference system. 
According to (116.26), the nondiagonal components Pias) also vanish. The remaining 
components of the Einstein equations give the following system of equations for the func- 
tions a, b, c: 


(abc): l 

abc = 2a?b?c? [(b* - c)? 7-4) 
(abe): l 

abc = 2a?b?c? [(a? - c?)* — &*], (118.3) 
(abe): _ 


] 
LIC (a? — b*)? — c*]. 
abc 2a? p?c? 1 
+ The frame vectors corresponding to these constants are: 
] = (sin x?, — cos x? sin x?, 0), m = (cos x°, sin x? sin x!, 0), n = (0, cos x! , 1). 


The coordinates run through values in the ranges O«zx! x s, Oz x?« 2m, O« x? «4m. The space is closed, 
and its volume 


V = (V y dx'dx!dx? = abc Í sin x'dx!dx?dx? = l6r?abc. 
When a — b — c it goes over into a space of constant positive curvature with radius of curvature 2a. 
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d + b + d 0 118.4 
a b e ` (118.4) 
[Equations (118.3) are the equation set R}, = Rà, = Ris, = 0; equation (118.4) is the 
equation RÌ = 0. 
The time derivatives in the system (118.3-4) take on a simpler form if we introduce in 
place of the functions a, b, c, their logarithms a, f, y: 


a =e", b=, c - e, (118.5) 


and in place of f, the variable z: 


dt = abc dt. (118.6) 
Then: 

2x... = (b? —-cy — a*. 

2B. = (dy -b. (118.7) 


27,4, = (a? - b? - c*; 
Not f Ey), m Abat V+ bY (118.8) 


where the subscript, t denotes differentiation with respect to t. Adding equations (118.7) 
and replacing the sum of second derivatives on the left by (118.8). we obtain: 


x reae Buy. = Hatt b c*-2a?b? —2a?c? - 2b?c?). (118.9) 


This relation contains only first derivatives, and is a first integral of the equations (118.7). 
Equations (118.3-4) cannot be solved exactly in analytic form, but permit a detailed 
qualitative study in the neighborhood of the singular point. 
We note first that if the right sides of equations (118.3) or (118.7) were absent, the system 
would have an exact solution, in which 


a~ AR bw Pm, C~ t, (1 18.10) 
where Pi, Pm and p, are numbers connected by the relations 
Pit Pmt Pn = Pi + Pmt Pa = 1 (118.11) 


[the analog of the Kasner solution (117.8) for a homogeneous flat space). We have denoted 
the exponents by Pı, Pms Pa» Without assuming any order of their size; we shall retain the 
notation p,, Pa, Pa of $117 for the triple of numbers arranged in the order p, «p; «p; 
and taking on values in the intervals (117.10) respectively. These numbers can be written 
in parametric form as 


—u ictu _ u(l +u) 


yugu? PROS yy POOS yugur OISID 


piu) = 


All the different values of the p,, P2, p, (preserving the assumed order) are obtained if the 
parameter u runs through values in the range uz |. The values u« are reduced to this 
same region as follows: 


l l 1 
Py (=) = plu), p G) = pu). p, (=) = py). (118.13) 
u u u 


Figure 25 shows graphs of p,, p, and p, as functions of l/u. 
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We assume that the right sides of (118.7) are small over some time interval, so that they 
can be neglected and we have a "Kasner-like" regime (118.10). Such a situation cannot 
continue indefinitely as t+0, since some of the terms must be increasing. Thus, if the 
negative exponent belongs to the function a(t) (so that p, = p, <0), then the disruption 
of the Kasner regime arises from the terms a^; the other terms will drop off as t decreases. 

Keeping only these terms on the right of (118.7), we get the system of equations 


(oreet Bar 5 Yra dee (118.14) 


The solution of these equations should describe the evolution of the metric from the 
“initial” state,t in which it is described by (118.10) with a definite choice of exponents 
(with p, «0); let p, = Py, Pm = P2, p, = P3, so that 


a — (P, b510, ¢ = 1?3 


(the proportionality coefficients in these expressions can be set equal to unity without any 
loss of generality in the result obtained below). Since abc = t, t = In z+ const, the initial 
conditions for (118.14) are formulated in the form 


Ar = pe Br= pn Y. py 


The first of equations (118.14) has the form of the equation of one-dimensional motion 
of a particle in the field of an exponential potential wall, where « plays the role of the co- 
ordinate. In this analogy, to the initial Kasner regime there corresponds a free motion with 
constant velocity x, = p,. After reflection from the wall, the particle will again move freely 
with the opposite sign of the velocity: a, = — p,. We also note that from equations (118.14), 
a. +f, = const, and «,+y, = const, hence we find that $, and y, take the values 


P: pot2po Y. P3t+2py. 


t We recall that we are considering the evolution of the metric as f—>0, so that the initial conditions corres- 
pond to a later, and not to an earlier time. 
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Now determining a, f, y, and then 1, using (118.6), we find 


egt. enw elP2t 200. e! w el P3 * ?P0t 
tw elitr 
i.e. 
aw Pe b P Pm C~ P”, 
where 
yy = Pl p, = RTP) Q2 5-2 (118.15) 
1—2]|p,| m 1-2|pi|' " 1—2|p,| 


Thus the action of the perturbation results in the replacement of one Kasner regime 
by another, with the negative power of t shifting from one direction | to another m: if we 
had p, <0, then now p, <0. In the course of the shift the function alt) goes through a maxi- 
mum, and the function b(r) through a minimum: the quantity b(r) which previously de- 
creased, begins to increase, the rising function a(f) starts to drop, while c(t) continues to fall 
off. The perturbation itself [the terms a* in (118.7)] which were rising, begin to drop and are 
damped out. Further evolution of the perturbation leads in an analogous way to increase 
of the perturbation due to the terms b* in (118.7), another shift of Kasner regime, etc. 

The rule for the shift of exponents (118.15) is conveniently represented by the parametriza- 
tion (118.12): if 


Pi = pilu), Pm = piu) Pa = pau). 
then 


pi = piu—l) Pa pí((u—l) pa = p3(u—1). (118.16) 


The larger of the two positive exponents remains positive. 

This process of shifting of Kasner regimes is the key to understanding the character of the 
evolution of the metric as we approach the singularity. 

The successive shifts (118.16) with bouncing of the negative exponent between the direc- 
tions I and m continues so long as the integral part of the initial value of u is not exhausted 
and u becomes less than unity. The value u < | is transformed into u> 1, according to(118.13); 
at that moment, either p, or p,, is negative, while p, becomes the smaller of the two positive 
numbers (p, = p;). The following series of shifts will now bounce the negative exponent 
between the directions n and I or between n and m. For an arbitrary (irrational) initial value 
of u, the process of shifting continues without end. 

In an exact solution the exponents p,, p; and p, will, of course, lose their literal meaning. 
We note that the smearing that this produces in the definition of these numbers (and with 
them, the parameter u) although small, makes it meaningless to consider any special (e.g. 
rational) values of u. This is why it is only meaningful to consider those regularities that are 
typical of the general case with irrational u, 

Thus the process of evolution of the model toward the singular point is made up of 
successive series of oscillations, during which the distances along two of the space axes 
oscillate, while they fall off monotonically along the third; the volume drops off according 
to a law that is close to ~£. In going from one series to the next, the direction along which 
there is a monotonic dropoff of distances shifts from one direction to another. Asymptoti- 
cally the order of these shifts takes on the character of a random process. The order of 
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succession of lengths of successive series of oscillations (i.e. the number of Kasner epochs 
in each series)t also takes on stochastic character. 

The successive series of oscillations crowd together as we approach the singularity. An 
infinite number of oscillations are contained between any finite world time ¢ and the mo- 
ment ¢ = 0. The natural variable for describing the time behavior of this evolution is 
not the time 7, but its logarithm, In z, in terms of which the whole process of approach 
to the singular point is spread out to — oc. 

In the solution described here we simplified the problem from the very start, assuming 
that the matrix 7,,(7) in (116.3) is diagonal. The inclusion of nondiagonal components of na» 
does not change the oscillatory character of the evolution of the metric or the law (118.16) 
for the shift of the exponents p,, p,, p, of successive Kasner epochs. It leads, however, 
to the appearance of an additional property: the shifting of the exponents is also accom- 
panied by a change in the directions of the axes to which the exponents apply.t 


$ 119. The time singularity in the general cosmological solution of the Einstein equations 


It has already been pointed out that the adequacy of the Friedmann model for describing 
the present state of the Universe is no basis for expecting that it is equally suitable for 
describing its early stages of evolution. One may even ask to what extent the existence of a 
time singularity is a necessary general property of cosmological models, or whether it is 
really caused by the specific simplifying assumptions on which the models are based (in 
particular, symmetry assumptions). We emphasize that when we speak of a singularity, 
we have in mind a physical singularity, a place where the density of matter and the invariants 
of the curvature tensor become infinite. 

If the presence of the singularity were independent of these assumptions, it would mean 
that it is inherent not only to special solutions, but also to the general solution of the 
Einstein equations. The criterion of generality of the solution is the number of "physically 
arbitrary" functions contained in it. In the general solution the number of such functions 
must be sufficient for arbitrary assignment of initial conditions at any chosen time [4 for 
empty space, 8 for space filled with matter (cf. § 95)].§ 


+ If the “initial” value of the parameter u is uo = ko-- xo (where ko is an integer and xo « 1) the length 


of the first series of oscillations will be ko, while the initial value for the next series will be u, = l/xo zk,-F- xi, 
etc. From this it is easy to conclude that the lengths of successive series will be given by the elements Ko, ki, 
kı... of the expansion of uo in an infinite (for irrational xo) continued fraction 
1 
to kor 
1 
ki 
l 
kı- 
k3— m 


The succession of values of further elements in such an expansion is distributed according to statistical laws. 

t Concerning this and other details of the behavior of homogeneous cosmological models of this type, 
cf. V. A. Belinskii, E. M. Lifshitz and I. M. Khalatnikov, Adr. in Physics 19, 525 (1970); Adv. in Physics. 31, 
639 (1982). 

$ We emphasize that for a system of nonlinear equations, such as the Einstein equations, the notion of a 
general solution is not unambiguous. In principle more than one general integral may exist, each of the 
integrals covering not the entire manifold of conceivable initial conditions, but only some finite part of it. 
The existence of a general solution possessing a singularity does not therefore preclude the existence of 
other general solutions that do not have a singularity. For example, there is no reason to doubt the existence 
of a general solution, without singularities, that describes a stable, isolated body with not too large mass. 
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Finding such a solution in exact form, for all space and over all time, is clearly impossible. 
But to solve our problem it is sufficient to study the form of the solution only near the 
singularity. 

The singularity of the Friedmann solution is characterized by the fact that the vanishing 
of spatial distances occurs according to the same law in all directions. This type of singula- 
rity is not sufficiently general: it is typical of a class of solutions that contain only three 
physically arbitrary coordinate functions (cf. the problem in § 113). We also note that these 
solutions exist only for a space filled with matter. 

The singularity of oscillatory type, considered in the preceding section, is general in 
character—there exists a solution of the Einstein equations with such a singularity and 
containing the required set of arbitrary functions. We shall briefly describe the way to 
construct such a solution, without going into the details of the calculation.t 

As in the homogeneous model (§ 118) the regime of approach to the singularity consists 
of successive series of Kasner epochs replacing one another. In the course of each epoch, 
the main terms (in 1/r) in the spatial metric tensor (in the synchronous reference frame) 
have the form (118.1) with functions of the time a, b, c from (118.10), but the vectors 
l, m, n are now arbitrary functions of the space coordinates (and not definite functions, 
as in the homogeneous model). Now the p,, Pms p, are also functions (and not simply 
numbers), again related by the formulas (118.11). The metric constructed in this way satisfies 
the equations Rj = 0 and AR = 0 for the vacuum field (at least, their leading terms) over 
a certain finite time interval. The equations R? = 0 give rise to three relations (not contain- 
ing the time) that must be imposed on the arbitrary functions of the space coordinates 
that are contained in the y,,. These relations connect ten different functions: three compo- 
nents of each of the three vectors I, m, n and one function of the time exponents (since the 
functions p;, Pm» p, are connected by two conditions (118.11). In determining the number of 
physically arbitrary functions we must also consider that the synchronous reference frame 
still admits transformations of the three spatial coordinates that do not affect the time. 
Thus the metric contains altogether 10—3—3 — 4 arbitrary functions—just the number 
one should have in the general solution for the field in vacuum. 

The replacement of one Kasner regime by another occurs (just as in the homogeneous 
model) because of the presence in three of the six equations R? = 0 of terms which, with 
decreasing t, increase more rapidly than the others, thus playing the role of a perturba- 
tion destroying the Kasner regime. These equations, in the general case, have a form differing 
from (118.14) only in a factor depending on the space coordinates (l-curl l/I-m x n) on 
their right sides (where it is understood that of the three exponents p, Pm» Pas p, is negative). t 
Since, however, eqs. (118.14) are a system of ordinary differential equations with respect 
to the time, this difference in no way affects their solution or the law that follows from 
this solution concerning the shift of Kasner exponents (118.16), or any of the further 
conclusions presented in 8118.8 

The degree of generality of the solution is not reduced if matter is introduced: the matter 


t They can be found in V. A. Belinskii, E. M. Lifshitz and I. M. Khalatnikov, Adv. in Physics. 31, 639 (1982). 

* For the homogeneous model, this factor coincides with the square of the structure constant C'!, and 
is constant by definition. 

i If we impose on the arbitrary functions in the solution the supplementary condition I-curli = 0, 
the oscillations disappear. and the Kasner regime will continue right up to the point r = 0. Such a solution, 
however, contains one function fewer than is required in the general case. 
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"inscribes" itself on the metric through all of the four new coordinate functions that are 
needed for assigning the initial distribution of its density and its three velocity components. 
The energy-momentum tensor 77 of the matter introduces terms in the field equations 
that are of higher order in t/z than the leading terms (in precise analogy to what was shown 
in problem 3 of § 117 for the plane homogeneous model). 

Thus the existence of a singular point in the time is an extremely general property of 
solutions of the Einstein equations, where the process of approach to the singular point in 
the general case is oscillatory in character. We emphasize that this character is not related 
to the presence of matter (and therefore not related to its equation of state), and is already 
typical of the empty space-time itself. The singularity of the monotonic, isotropic type, 
typical of the Friedmann solution. and dependent on the presence of matter, has only 
special significance. When we speak of singularities in the cosmological sense. we have 
in mind a singular point that 1s reached over all the space and not just over some limited 
part, as in the gravitational collapse of a finite body. But the generality of the oscillatory 
solution gives one a basis for assuming that the singularity reached by a finite body in its 
collapse below the event horizon in the comoving reference frame has this same character. 

We have continually spoken of the direction of approach to the singular point as the 
direction of decreasing time; but in view of the symmetry of the Einstein equations under 
time reversal, we could with equal justification speak of approach to the singularity in the 
direction. of increasing time. Actually, however, in view of the physical nonequivalence 
of future and past, there is an essential difference between these two cases in the very 
formulation of the question. A singularity in the future can have physical meaning only 
if it is reachable for arbitrary initial conditions assigned at some preceding moment of time. 
Clearly there is no reason why the distribution of matter and field that was reached at some 
moment in the process of evolution of the universe should have corresponded to the precise 
conditions required for the appearance of some particular solution of the Einstein equations. 

Concerning the question of the type of singularity in the past, an investigation based 
solely on some equations of gravitation can hardly give a general answer. It is natural to 
think that the choice of solution corresponding to the real world is related to some pro- 
found physical requirements, whose establishment on the basis of the existing theory of 
gravitation alone is Impossible, and whose explanation will come only as the result of a 
further synthesis of physical theories. In this sense it could in principle turn out that this 
choice corresponds to some particular (e.g. isotropic) type of singularity. 

Finally, it is necessary still to make the following remark. The domain of applicability 
of the Einstein equations in themselves is in no way limited in the region of small distances 
or large densities of matter in the sense that the equations in this limit do not lead to any 
internal contradictions (in contrast, for example, to the classical equations of electro- 
dynamics). In this sense, the investigation of the singularity of the space-time metric on 
the basis of the Einstein equations is entirely correct. There is no doubt, however, that in 
this limit quantum phenomena become important, and we can say nothing about them 
in the present state of the theory. Only in a future synthesis of the theory of gravitation 
and quantum theory will it become clear which of the results of classical theory remain 
meaningful. At the same time there is no doubt that the very fact of the appearance of a 


T The fact that a singular point exists in the general solution of the Einstein equations was first shown by 
R. Penrose, 1965, by topological! methods, which, however, do not enable one to establish the specific 
analytic character of the singularity. A presentation of these methods and the theorems obtained using them 
is given in R. Penrose, Structure of Space-Time, W. A. Benjamin, N.Y., 1968. 
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singularity in the solutions of the Einstein equations (both in their cosmological aspect 
and for the collapse of finite bodies) has a profound physical meaning. One must not forget 
that the achievement during gravitational collapse of densities so enormous and, yet, 


still undoubtedly correctly described by the classical theory of gravitation, is enough for 
us to speak of a physically "singular" phenomenon. 
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